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Preface 


The  basic  sciences  and  physics,  in  particular,  are  of  prime  irrt- 
portance  today  in  the  training  of  engineers  for  the  various 
branches  of  the  national  economy.  This  has  led  to  fundamental 
changes  in  recent  years  in  the  teaching  of  physics  in  engineer¬ 
ing  institutes,  and  to  the  students  of  other  educational  insti¬ 
tutions  in  which  they  do  not  major  in  physics.  The  scope  and 
scientiiic  level  of  physics  courses  have  been  substantially  sup¬ 
plemented  and  cover  the  main  trends  in  the  development  of 
modern  physics.  Consequently,  the  physics  textbooks  for  en¬ 
gineering  students  have  inevitably  become  three-volume  edi¬ 
tions  of  almost  fifteen  hundred  pages.  The  need  has  arisen, 
in  this  connection,  for  a  concise  handbook  on  this  subject. 
The  aim  of  the  authors  was  to  fulfill  this  need. 

I  n  scope  and  depth  this  handbook  includes  all  of  the  defini¬ 
tions,  formulas  and  information  covered  in  the  most  comprehen¬ 
sive  and  up-to-date  physics  courses  of  engineering  institutes 
and  the  physics  departments  of  universities  and  colleges.  Physi¬ 
cal  laws  are  concisely  formulated,  all  the  necessary  explana¬ 
tions  are  given  and,  in  many  cases,  derivations  as  well.  Though  it 
plays  a  vital  role  in  the  teaching  of  physics,  experimental  ma¬ 
terial  has  been  omitted.  This  is  due  only  to  a  lack  of  space. 
All  the  units  and  symbols  comply  with  the  requirements  of  the 
SI.  Units  of  physical  quantities  and  systems  of  units  are  listed 
and  dealt  with  in  a  short  appendix. 

This  handbook  is  designed  primarily  for  engineering  students, 
as  well  as  college  and  university  students  studying,  but  not 
majoring  in  physics.  It  can  be  used  to  advantage  by  engineers 
ami  graduate  students,  as  well  as  by  instructors  and  lecturers  of 
intermediate  schools  and  colleges. 
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PREFACE 


Mathematical  knowledge  required  in  using  the  handbook  is 
within  the  scope  of  the  ordinary  mathematics  courses  of  engi¬ 
neering  institutes.  The  detailed  index  and  the  numerous  cross 
references,  indicating  the  chapter,  section  and  subsection,  are 
of  aid  in  finding  any  required  information.  The  authors  are 
sincerely  grateful  to  the  translator,  N.  Weinstein,  and  the 
science  editor,  T.  Baranovskaya,  for  their  comprehensive  and 
expert  work  in  the  publication  of  the  English  edition  of  this 
handbook,  as  well  as  for  their  valuable  suggestions,  which  should 
facilitate  its  use  by  the  reader. 


B.  Yavorsky 
A.  Detlaf 


Parts  One,  Four,  Five  and  the  appendices  were  written  by 
Prof.  A.  A.  Detlaf;  Parts  Two,  Three,  Six,  Seven  and  Eight, 
by  Prof.  B.  M.  Yavorsky. 


PART  ONE  MECHANICS 


CHAPTER  1  KINEMATICS 

1.1  Mechanical  Motion.  The  Subject  Matter 
of  Mechanics 

1.1.1  The  simplest  form  of  motion  in  nature  is  mechanical 
motion.  It  consists  of  changes  in  the  relative  positions  of  bodies 
or  their  parts  in  space  in  the  course  of  time.  The  branch  of  phys¬ 
ics  dealing  with  the  laws  of  mechanical  motion  is  called  me¬ 
chanics.  In  a  more  narrow  sense  of  the  word,  this  is  often  under¬ 
stood  to  mean  classical  mechanics,  which  deals  with  the  motion 
of  macroscopic  bodies  travelling  at  velocities  a  great  many 
limes  less  than  that  of  light  in  free  space.  Classical  mechanics 
is  based  on  Newton’s  laws  and  is  therefore  frequently  referred 
to  as  Newtonian  mechanics.  The  laws  of  motion  of  bodies  at 
velocities  close  to  that  of  light  in  free  space  are  taken  up  in 
relativistic  mechanics  (5.1.1),  and  the  laws  of  motion  of  micro¬ 
particles  (for  example,  the  electrons  in  atoms,  molecules,  crys¬ 
tals,  etc.)  are  the  subject  matter  of  quantum  mechanics  (38.1.1). 

1.1.2  Classical  mechanics  comprises  three  basic  branches:  sta¬ 
tics,  kinematics  and  dynamics.  Statics  deals  with  the  laws  for 
the  composition  of  forces  and  the  conditions  for  the  equilibrium 
of  bodies.  Kinematics  provides  a  mathematical  description  of  all 
possible  kinds  of  mechanical  motion,  regardless  of  what  causes 
each  particular  kind  of  motion.  Dynamics  investigates  the  effect 
of  the  interaction  between  bodies  on  their  mechanical  motion. 

1.1.3  The  mechanical  properties  of  bodies  are  determined  by 
their  chemical  nature,  internal  structure  and  state.  These  are 
beyond  the  scope  of  mechanics  and  are  studied  in  other 
branches  of  physics.  Hence,  in  accordance  with  the  conditions  of 
each  specific  problem  on  hand,  various  simplified  models  are 
employed  to  describe  real  bodies  in  motion.  These  include  the 
particle,  the  perfectly  rigid  body,  the  perfectly  elastic  body, 
the  perfectly  inelastic  body  and  others. 
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KINEMATICS  [i 


A  material  point,  or  particle,  is  a  body  whose  shape  and  size 
are  oi  no  consequence  under  the  conditions  oi  the  problem 
being  considered.  For  example,  in  a  lirst  approximation,  the 
motion  oi  a  ship  from  one  point  to  another  can  be  dealt  with  as 
the  motion  oi  a  particle,  ii,  however,  it  becomes  necessary  to 
take  into  account  such  details  oi  this  motion  as  the  pitching 
and  rolling  oi  the  ship  by  swells  or  heavy  seas,  the  ship  should 
be  dealt  with  as  an  extended  body  oi  delinite  shape.  In  the  liter¬ 
ature  the  word  particle  is  more  irequently  used  than  mate¬ 
rial  point. 

Any  extended  body  or  system  oi  such  bodies,  making  up  the 
mechanical  system  being  investigated,  can  be  regarded  as  a 
system  oj  particles.  For  this  purpose,  ail  the  bodies  oi  the  system 
can  be  conceived  oi  as  being  broken  down  into  such  a  large 
number  oi  parts  that  the  size  oi  each  part  is  negligibly  small 
compared  to  the  sizes  oi  the  bodies  themselves. 

1.1.4  A  perfectly  rigid  body,  or  simply  rigid  body,  is  one  whose 
deformation  can  be  neglected  under  the  conditions  oi  the  given 
problem.  The  distance  between  any  two  points  oi  a  rigid  body 
remains  constant  no  matter  what  action  the  body  is  subjected 
to.  A  perfectly  rigid  body  can  be  regarded  as  a  system  oi  par¬ 
ticles  rigidly  connected  to  one  another. 

A  perfectly  elastic  body  is  one  whose  deformation  obeys  Hooke’s 
law  (40.3.4).  Alter  the  external  forces  applied  to  such  a  body 
have  been  removed,  its  initial  size  and  shape  are  completely 
restored. 

A  perfectly  inelastic  body  is  one  that  completely  retains  its 
deformed  state,  caused  by  external  forces  applied  to  the  body, 
when  these  forces  have  been  removed. 

1.2  Frames  of  Reference.  Path,  Path  Length 
and  Displacement  Vector  of  a  Particle 

1.2.1  The  position  oi  a  body  in  space  can  be  specilied  only  in 
reierence  to  other  bodies.  It  makes  sense,  lor  instance,  to  speak 
oi  the  location  oi  the  planets  with  respect  to  the  sun,  that  oi 
an  air  liner  or  steamship  with  respect  to  the  earth,  but  we 
cannot  indicate  their  position  in  space  “in  general”  without  rei¬ 
erence  to  some  specific  body.  A  perfectly  rigid  body  in  which 
a  coordinate  system  is  rigidly  fixed,  furnished  with  a  “clock” 
and  employed  to  determine  the  position  oi  bodies  and  particles 
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being  investigated  in  space  at  various  instants  of  time,  is  called 
a  frame  of  reference.  Sometimes  the  frame  of  reference  is  the 
coordinate  system  itself  if  it  is  provided  with  a  “clock”  and 
the  solid  in  which  the  system  is  rigidly  fixed  is  called  the  body 
of  reference.  In  each  specific  problem  the  frame  of  refer¬ 
ence  is  selected  so  as  to  simplify  the  problem  to  the  maximum 
extent.  Inertial  frames  of  reference  (2.1.2)  are  the  most  com¬ 
monly  used  in  physics. 

1.2.2  Most  extensively  employed  is  the  Cartesian  rectangular 
system  of  coordinates  (Fig.  1.1)  whose  orthonormal  (normal  or¬ 
thogonal)  basis,  is  formed  by  the 
three  vectors  i,  j  and  k,  which 
are  unit  vectors  with  respect 
to  their  magnitudes,  mutually 
orthogonal  and  pass  through 
I  he  origin  O  of  coordinates.  The 
position  of  an  arbitrary  point 
M  is  specified  by  the  radius 
vector  r,  which  connects  the  ori¬ 
gin  0  of  coordinates  to  point  M . 

Vector  r  can  be  resolved  along 
the  axes  i,  j  and  k  of  the  basis 
as  follows:  X 


r  =  xi  +  i/j  +  zk, 


Fig.  1.1. 


where  xi,  yj  and  zk  are  the  com¬ 
ponents  of  vector  r  along  the 

coordinate  axes.  The  coefficients  of  resolution,  x,  y  and  z,  are 
I  lie  Cartesian  coordinates  of  point  M  and,  due  to  the  orthogo¬ 
nality  of  the  vectors  of  the  basis,  the  projections  of  radius 
vector  r  on  the  corresponding  coordinate  axes. 

The  motion  of  a  particle  is  completely  specified  if  three  con- 
linuous  and  single- valued  functions  of  the  time  t, 


x  =  x  (t),  y  =  y  ( t )  and  z  =  z  ( t ), 

are  given  to  describe  the  changes  in  the  coordinates  of  the 
particle  with  time. 

These  equations  are  said  to  bo  the  kinematic  equations  of  motion 
of  a  particle.  They  are  equivalent  to  the  single  vector  equation 
r  =  r  (t),  for  a  moving  particle. 
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1.2.3  The  line  described  by  a  moving  particle  in  space  is  called 
its  path.  The  kinematic  equations  of  motion  of  a  particle  spec¬ 
ify  the  equation  of  its  path  in  parametric  form  (the  parameter 
being  the  time  t).  Depending  upon  the  shape  of  the  path,  dis¬ 
tinction  is  made  between  rectilinear  and  curvilinear  motion. 
The  motion  of  a  particle  is  said  to  be  plane  if  its  path  lies  com¬ 
pletely  in  a  single  plane. 

The  mechanical  motion  of  a  body  is  relative,  i.e.  the  nature  of 
the  motion  and,  in  particular,  the  kind  of  path  travelled  by 
the  points  of  the  body  depend  upon  the  chosen  frame  of  re¬ 
ference. 

1.2.4  In  the  general  case,  the  path  of  a  moving  particle  is  a 
spatial,  rather  than  plane,  curve.  Introduced  for  such  curves  is 

the  concept  of  an  osculating 
plane.  The  osculating  plane  at 
the  arbitrary  point  M  of  a 
curve  is  in  the  limiting  position 
f  a  plane  passing  through  any 
three  points  of  the  curve  when 
g  these  points  approach  point  Af 
without  limit. 

An  osculating  circle  at  point  Af 
of  a  curve  is  the  limit  of  a  circle 
passing  through  three  points 
of  the  curve  being  considered 
when  these  points  approach  po¬ 
int  Af  without  limit.  The  oscu¬ 
lating  circle  lies  in  the  oscula- 
Fig.  1.2  ting  plane.  The  centre  of  the  oscu¬ 

lating  circle  and  its  radius  are 
called  the  centre  of  curvature  and 
radius  of  curvature,  at  point  Af ,  of  the  curve  being  considered. 
The  straight  line  connecting  point  Af  to  the  centre  of  curvature 
is  called  the  principal  normal  to  the  curve  at  point  Af.  The  tan¬ 
gent  to  the  curve  at  point  Af  is  perpendicular  to  the  principal 
normal  and  also  lies  in  the  osculating  plane. 

1.2.5  The  path  length  is  the  sum  of  the  lengths  of  all  the  por¬ 
tions  of  the  path  over  which  the  particle  passes  in  the  given 
interval  of  time.  The  instant  of  time  t  =  from  which  we 
begin  to  consider  the  motion  of  a  particle,  is  called  the  initial 
instant  of  time,  and  the  position  of  the  particle  at  this  instant 
(point  A  in  Fig.  1.2)  is  called  its  initial  position ■  Owing  to  the 
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arbitrary  nature  of  the  time  reference  point  it  is  usually  as¬ 
sumed  that  t0  =  0.  The  path  length  s,  travelled  by  the  particle 
from  its  initial  position,  is  a  scalar  function  of  time:  s  =  s  (<). 
As  is  obvious  from  its  definition,  the  path  length  of  a  particle, 
cannot  be  a  negative  quantity.  If  the  particle  travels  along  an 

arc  of  the  path  AB  (Fig.  1.2),  always  in  the  same  direction  and 

> - - 

is  at  point  M  at  the  instant  of  time  t,  then  s  ( t )  =  AMI.  If  the 
particle  travels  along  the  path  in  a  more  complicated  way,  for 
example  if  it  travels  from  A  to  B,  reaching  B  at  the  instant  of 
time  tx  <  t,  and  then  reverses,  returning  to  point  M  at  instant  i, 

then  s  ( t )  =^B  -f-  BM. 

1.2.6  The  displacement  vector  of  a  moving  particle  during  the 
time  interval  from  t  —  tt  to  t  —  t2  is  the  vector  from  the,  posi¬ 
tion  of  the  particle  at  the  instant  of  time  tf  to  its  position  at 
instant  t2.  It  is  equal  to  the  increment  in  the  radius  vector  of 
the  particle  during  the  time  interval  being  considered: 

r2  —  rj  =  r  (t2)  —  r  (fr). 

The  displacement  vector  is  always  directed  along  the  chord 
subtending  the  corresponding  portion  of  the  path. 

Shown  in  Fig.  1.2  is  the  displacement  vector  of  the  particle  for 
the  time  interval  from  t„  to  t.  It  is  equal  to  r  —  r0  =  r  (t)  — 
—  r  (tB). 

The  displacement  vector  of  a  moving  particle  for  the  time  inter¬ 
val  from  t  to  t  +  At  equals 

Ar  =  r  (t  +  At)  —  r  (t)  =  Ax  X  i  +  Ay  X  j  Az  X  k, 

where  A*,  Ay  and  Az  are  the  increments  (changes)  in  the  coordi¬ 
nates  of  the  particle  during  the  time  interval  being  considered. 

1 .2.7  A  particle  travelling  freely  in  space  can  accomplish  only 
three  independent  motions ,  i.e.  motions  that  cannot  be  repre¬ 
sented  as  combinations  of  the  other  two.  As  a  matter  of  fact, 
the  motion  of  a  particle  along  each  of  the  axes  of  a  Cartesian 
rectangular  coordinate  system  cannot  be  obtained  by  its  motion 
along  the  other  two  axes.  The  number  of  independent  motions 
that  a  mechanical  system  can  accomplish  is  called  the  number 
"f  degrees  of  freedom  of  the  system.  Hence,  a  free  particle  has 
three  degrees  of  freedom. 
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1.3  Velocity 

1.3.1  The  concept  of  velocity  is  introduced  in  mechanics  to 
characterize  the  rate  of  motion  of  bodies.  It  is  the  time  rate  of 
change  in  the  position  of  moving  bodies.  The  average  velocity  of 
a  moving  particle  in  the  time  interval  from  t  to  t  -j-  At  is  the 
vector  vav,  equal  to  the  ratio  of  the  increment  Ar  in  the  radius 
vector  of  the  particle  during  this  time  interval  to  the  length  of 
the  time  interval  A t: 


vav  = 


Ar 

At 


Vector  vav  has  the  same  direction  as  Ar,  i.e.  along  the  chord 
subtending  the  corresponding  portion  of  the  particle’s  path. 

1.3.2  The  velocity  (or  instantaneous  velocity)  of  a  moving  particle 
is  the  vector  quantity  v,  equal  to  the  first  time  derivative  of 
the  radius  vector  r  of  the  particle  being  considered: 


v  = 


dr 
It  4 


The  velocity  of  the  particle  at  the  instant  of  time  t  is  the  lim¬ 
iting  value  that  the  average  velocity  vav  approaches  for  an 
infinitely  small  value  of  the  time  interval  At.  Thus 


v  =  lim 
At  -  0 


Ar 

~Kt 


lim  vav, 

At  -*0 


The  velocity  vector  v  of  the  particle  has  the  direction  along  a 
tangent  to  the  path  toward  the  motion  in  the  same  way  as  the 
vector  dr  =  y  dt  of  infinitely  small  displacement  of  the  particle 
during  an  infinitely  short  time  interval  dt. 

The  path  length  ds  travelled  by  the  particle  during  the  length  of 
time  dt  is  equal  to  the  magnitude  of  the  displacement  vector: 
ds  =  |  dr  |.  Therefore  the  magnitude  of  the  velocity  vector  of  a 
moving  particle  is  equal  to  the  first  time  derivative  of  the  path 
length: 
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1 .5.3  The  resolution  of  vector  v  along  the  basis  of  a  Cartesian 
rectangular  coordinate  system  is  of  the  following  form: 

v=s%-i  +  yj,j  +  u2k. 

The  projections  of  the  velocity  of  a  moving  particle  on  the  coor¬ 
dinate  axes  are  equal  to  the  first  time  derivative  of  the  corres¬ 
ponding  coordinates  of  the  particle: 


dx  'Hit-  ,  dz 

=~df  ’  Vy~~dt  and  P*=?t> 


and  the  magnitude  of  the  velocity  vector,  sometimes  called  the 
speed,  is 


1.3.4  In  rectilinear  motion  of  a  particle,  the  direction  of  its 
velocity  vector  remains  constant.  The  motion  of  a  particle  is 
said  to  be  uniform  if  the  magnitude  of  its  velocity  does  not 
change  with  time:  v  =  ds/dt  =  const.  The  path  length  s  tra¬ 
velled  by  a  uniformly  moving  particle  is  a  linear  function  of 
time:  s  =  vt  (provided  that  t0  =  0,  see  Sect.  1.2.5). 

If  the  magnitude  of  the  particle’s  velocity,  i.e.  its  speed,  increa¬ 
ses  with  time  ( dvldt  >  0),  the  motion  is  said  to  be  accelerated ; 
if  the  speed  decreases  with  time  ( dvldt  <  0),  the  motion  is  said 
to  be  decelerated. 

1.3.5  The  average  velocity  of  nonuniform  motion  of  a  particle 
over  a  given  portion  of  its  path  is  the  scalar  uav,  called  the 
travelling  speed,  which  is  equal  to  the  ratio  of  the  length  As  of 
this  portion  to  the  length  At  of  the  time  interval  required  to 
travel  over  this  portion: 


It  is  equal  to  the  speed  of  uniform  motion  in  which  the  time 
required  to  travel  the  same  path  length  As  is  the  same  as  in 
the  nonuniform  motion  being  considered. 

In  curvilinear  motion  of  a  particle  |  Ar  |  <  As.  Therefore,  in 
the  general  case,  the  average  velocity  (travelling  speed)  uay 


26 


KINEMATICS  [1 


of  the  particle  is  not  equal  to  the  magnitude  of  its  average 
velocity  vav  over  the  same  portion  of  its  path  (1.3.1):  vav  > 
5s  I  vav  |,  where  the  equality  sign  corresponds  to  a  rectilinear 
portion  of  the  path. 

1.3.6  In  the  case  of  plane  motion  of  a  particle  M  (1.2.3),  it 
often  proves  convenient  to  use  the  polar  coordinates  r  and  <p, 

where  r  is  the  distance  from  pole 
0  to  particle  M  and  cp  is  the  vec¬ 
torial  angle  measured  from  polar 
axis  OA  (Fig.  1.3).  The  velocity 
v  of  particle  M  can  be  resolved 
into  two  mutually  perpendicular 
components:  the  radial  velocity  vr 
and  the  transverse  velocity  v^: 

v  =  vr  +  v,,, 

in  which 


Fig.  1.3 


Vr='r' ‘srr  and  v*=-|rfkr]’ 


where  r  is  the  polar  radius  vector  of  particle  M,  and  k  is  the 
unit  vector  directed  perpendicular  to  the  plane  of  motion  of 
the  particle  so  that  the  rotation  of  radius  vector  r  upon  an 
increase  in  angle  cp  is  seen,  from  the  head  of  unit  vector  k,  to 
be  counterclockwise. 

The  magnitude  v  of  the  velocity  vector  of  particle  M,  in  plane 
motion,  is 


During  the  infinitely  short  length  of  time  dt,  polar  radius 
vector  r  of  the  particle  in  plane  motion  sweeps  over  a  sector  of  a 
circle,  the  area  of  the  sector  being  dS  =  r2dq>/2.  Hence,  the 
quantity 


.2  _ 

dt  2 


<P 


is  called  the  areal  velocity. 
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1.4  Acceleration 

1.4.1  The  concept  of  acceleration  is  introduced  in  mechanics  to 
characterize  the  rate  of  change  of  the  velocity  vector  of  a  mov¬ 
ing  particle.  ^The  average  acceleration  in  the  time  interval  from 
/  to  t  +  At  is  the  vector  aaV,  which  is  equal  to  the  ratio  of  the 
increment  Av  of  the  velocity  vector  of  the  particle,  during  the 
lime  interval,  to  the  length  of  the  time  interval  At: 


1.4.2  Acceleration  (or  instantaneous  acceleration)  of  a  moving 
particle  is  the  vector  quantity  a,  equal  to  the  first  time  deriv¬ 
ative  of  the  velocity  v  of  the  particle  being  considered  or,  other¬ 
wise,  the  second  time  derivative  of  the  particle’s  radius  vector  r: 

dv  d*r 

a—  dt  ~  dt ?  - 


The  acceleration  of  a  moving  particle  at  the  instant  of  time  t 
is  the  limiting  value  that  the  average  acceleration  aaV  ap¬ 
proaches  for  an  infinitely  small  time  interval  At.  Thus 


a=  lim  =  lim 
At  -  o  At  ^  o 


aav- 


1.4.3  The  resolution  of  vector  a  along  the  basis  of  a  rectangular 
Cartesian  coordinate  system  is  of  the  form: 

a  =  axl  +  ayi  +  azk- 


The  projections  of  the  acceleration  vector  of  a  moving  particle 
on  the  coordinate  axes  are  equal  to  the  first  time  derivatives  of 
the  corresponding  velocity  projections  or,  otherwise,  the  second 
time  derivatives  of  the  corresponding  coordinates  of  the  par¬ 
ticle: 


dvx  d2x  dvv  d2y 

°x~  ~~ HW  '  a»~~dt  HW 


and 


dvz  _  d2z 

az~~dT-!W- 
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The  magnitude  of  the  acceleration  vector  is 

/  d2x  \2  ,  (  d*y  \2  /  dH  \2 

~~  V  \  dl*  )  \  dl*  I  +  \  dl*  )  ' 


1.4.4  The  acceleration  vector  of  a  moving  particle  lies  in  the 
osculating  plane  (1.2.4)  passing  through  point  M  of  the  path 

being  considered  and  is  direct¬ 
ed  toward  the  concavity  of  the 
path  BC  (Fig.  1.4).  In  this  plane 
the  acceleration  vector  a  can 
be  resolved  into  two  mutually 
perpendicular  components  aT 
and  an: 


Fig.  1.4 


aT  +  an. 


1.4.5  Component  aT  is  called  the  tangential  acceleration  of  the 
moving  particle.  It  is  directed  along  the  tangent  to  the  path 
and  equals 

dv  .  dv 

T  and  ~~~dT’ 


where  x  =  yiv  is  the  unit  vector  of  the  tangent,  from  point  M 
of  the  path  in  the  direction  of  velocity  v  of  the  particle,  and  ax 
is  the  projection  of  the  tangential  acceleration  on  the  direction 
of  vector  v.  The  tangential  acceleration  characterizes  the  rate 
of  change  of  the  magnitude  of  the  particle’s  velocity  vector. 
Vectors  aT  and  v  coincide  in  direction,  i.e.  ax  >  0,  in  accelerat¬ 
ed  motion  (1.3.4);  vectors  at  and  v  are  opposite  in  direction 
i.e.  ax  <  0,  in  decelerated  motion  of  the  particle;  and  ax  =  0 
in  its  uniform  motion.  If  ax  =  const  =/=  0,  the  motion  is  said  to 
be  uniformly  variable.  In  uniformly  variable  motion  the  magni¬ 
tude  of  the  particle’s  velocity  is  linearly  time-dependent: 

v  =  v0  +  axi, 


where  u0  =  v  (0)  is  the  magnitude  of  the  initial  velocity,  i.e. 
(lie  velocity  at  the  initial  instant  ol  time  l  =  0.  II  nt  --  const  > 
>  U,  the  motion  of  the  particle  is  said  to  be  uniformly  acceler- 
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aled,  whereas  if  aT  =  const  <  0,  it  is  said  to  be  uniformly  de¬ 
celerated. 

1.4.6  Component  an  of  acceleration  a  of  the  moving  particle  is 
called  its  normal  acceleration.  It  is  directed  along  the  principal 
normal  to  the  path  at  point  M  being  considered,  toward  the 
centre  of  curvature  (1.2.4)  of  the  path.  Therefore,  a„  is  often 
called  the  centripetal  acceleration  of  the  particle.  The  normal 
acceleration  equals 

a2 

a  n  ~  ~ j  y'  R  j 

where  n  is  the  unit  vector  of  the  principal  normal,  and  J?  is  the 
radius  of  curvature  of  the  path.  The  normal  acceleration  charac¬ 
terizes  the  rate  of  change  in  direction  of  the  particle’s  velocity 
vector. 

If  the  particle  travels  in  a  straight  line  the  normal  acceleration 
a, i  —  0  and  the  acceleration  is  equal  to  the  tangential  accele¬ 
ration  of  the  particle:  a  ~  aT. 

1.5  Translational  and  Rotary  Motion  of  a  Rigid  Body  ' 

1.5.1  Translation  is  the  kind  of  motion  of  a  rigid  body  in  which 
any  straight  line  rigidly  fixed  in  the  body  (for  instance,  line 
AB  in  Fig.  1.5),  remains  paral¬ 
lel  to  its  initial  position  (T0fi(,). 

With  respect  to  the  earth,  a  lift 
cabin  (elevator  car),  lathe  tool, 
and  many  other  items  have 
translational  motion. 

In  the  translational  motion  ol  a 
rigid  body  all  of  its  points  are  I  |\ 
displaced  in  an  identical  way. 

During  the  infinitely  short  Lime  * - 

dl  the  radius  vectors  of  these  \  0 

points  are  changed  by  the  same 
amount  dr.  Hence  the  velocity  ***£•  1-5 
of  all  points  of  the  body  is  the 

same  in  each  instant  of  time  and  is  equal  to  dr/d/  and,  conse¬ 
quently,  their  acceleration  is  also  the  same.  For  this  reason, 

*  Only  perlectly  rigid  bodies  are  dealt  with  in  this  section;  for 
the  sake  of  brevity,  they  have  been  called  simply  “rigid  bodies”. 
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a  kinematic  discussion  of  the  translational  motion  of  a  rigid 
body  reduces  to  the  investigation  of  the  motion  of  any  one  of 
its  points,  or  particles.  Dynamics  usually  deals  with  the  motion 
of  the  centre  of  mass  of  the  body  (2.3.3).  A  rigid  body  travel¬ 
ling  freely  in  space  has  three  translational  degrees  of  freedom 
(1.2.7),  corresponding  to  its  translational  motions  along  the 
three  coordinate  axes. 

1.5.2  The  motion  of  a  rigid  body  in  which  two  of  its  points,  A 
and  B,  remain  fixed  is  called  rotation  (or  rotary  motion)  of  the 
body  about  a  fixed  axis.  The  fixed  straight  line  AB  is  the  axis 
of  rotation.  In  the  rotation  of  a  rigid  body  about  a  fixed  axis 
all  the  points  of  the  body  describe  circles  whose  centres  lie  on 
the  axis  of  rotation  and  whose  planes  are  perpendicular  to  this 
axis.  The  rotors  of  turbines,  motors  and  generators,  for  exam¬ 
ple,  have  rotary  motion  with  respect  to  the  earth  if  these  ma¬ 
chines  are  rigidly  mounted. 

A  rigid  body  rotating  about  a  fixed  axis  has  only  one  degree  of 
freedom  (1.2.7).  Its  position  in  space  can  be  completely  defined 
by  specifying  the  angle  of  rotation  (position  angle)  <p  from  some 
definite  (initial)  position. 

1.5.3  The  angular  velocity  serves  to  characterize  the  rate  and 
direction  of  rotation  of  a  body  about  an  axis.  The  angular  veloc¬ 
ity  is  the  vector  ©,  which  is  equal  in  magnitude  to  the  first 
derivative  of  the  angle  of  rotation  <p  with  respect  to  time  t 
and  is  directed  along  the  fixed  axis  of  rotation  in  such  a  way 
that  from  its  head  the  rotation  of  the  body  is  counterclockwise 
(Fig.  1.6)*: 

dtp  dtp  _ 

a=~dT  and  m==-ir- 


*  The  direction  of  vector  ©  can  also  be  determined  by  the 
right-hand  screw  rule.  It  coincides  with  the  direction  of  transla¬ 
tional  motion  of  a  right-hand  screw  rotating  together  with  the 
body.  Vectors  like  ©,  whose  direction  is  related  to  the  direction 
of  rotation  and  is  reversed  in  transforming  from  a  right-handed 
to  a  left-handed  system  of  coordinates,  are  called  pseudovectors , 
or  axial  vectors  (in  distinction  to  ordinary,  polar  vectors,  which 
are  not  reversed  in  the  above-mentioned  transformation  of  coor¬ 
dinates).  For  example,  the  vector  product  of  two  polar  vectors 
is  a  pseudovector,  and  that  of  a  pseudovector  and  a  polar  vector 
is  a  polar  vector. 
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Here  di p  is  the  vector  of  an  element  o [/rotation  of  the  body  dur¬ 
ing  the  time  dt.  This  vector  is  directed  along  the  axis  of  rota¬ 
tion,  also  according  to  the  right-hand  screw  rule  (Fig.  1.6). 
Axial  vectors  <itp  and  ©  have  no  definite  points  of  application: 
they  can  be  laid  off  from  any  point  on  the  axis  of  rotation. 
In  Fig.  1.6  they  are  laid  off  from  a  certain  point  O  of  the  fixed 


Fig.  1.6 


axis  of  rotation.  This  point  is  also  taken  as  the  origin  of  the 
reference  frame. 

notation  of  a'body  is  said  to  be  uniform  if  the  magnitude  of  its 
angular  velocity  remains  constant  with  time  to  =  const.  In 
this  case  the  angle  of  rotation  of  the  body  is  linearly  dependent 
on  time:  cp  =  at. 

1 .5.4  An  arbitrary  point  M  of  a  rigid  body  rotating  about  fixed 
axis  OZ  with  the  angular  velocity  ©  describes  a  circle  of  radius 
p  with  its  centre  at  point  O'  (Fig.  1.7).  As  distinct  from  the 
angular  velocity  of  the  body,  the  velocity  v  of  point  M  is  often 
called  the  linear  velocity.  It  is  perpendicular  both  to  the  axis 
of  rotation  (i.e.  to  vector  ©)  and  to  radius  vector  p,  from  the 
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centre  O'  of  the  circle  to  point  M,  and  is  equal  to  their  vector 
product: 

v  =  [cap]  =  [or)  and  v  —  cop. 

Here  r  =  00'  +  p  and  is  the  radius  vector  of  point  M,  from 
point  0  of  the  axis  of  rotation.  Point  0  is  taken  as  the  origin  of 
coordinates. 

1.5.5  The  period  of  revolution  is  the  time  interval  T  required  for 
a  body,  rotating  uniformly  at  the  angular  velocity  to,  to  com¬ 
plete  one  revolution  about  its  axis  of  rotation  (i.e.  to  rotate 
through  the  angle  q>  =  2 at):  T  =  2n/co. 

The  rotational  speed  n  =  i/T  —  co/2n  is  the  number  of  revolu¬ 
tions  made  by  a  body  in  unit  time  in  uniform  rotation  at  the 
angular  velocity  co. 

1.5.6  The  rotation  of  a  rigid  body  in  which  one  of  its  points 
remains  fixed  is  said  to  be  rotation  about  a  fixed  point.  This 
point  is  usually  taken  as  the  origin  of  the  fixed  reference  frame. 
In  such  rotation,  all  the  points  of  the  body  travel  along  the 
surfaces  of  concentric  spheres  whose  centre  is  at  the  fixed  point. 
At  each  instant  of  time,  this  motion  of  the  rigid  body  can  be 
regarded  as  rotation  about  a  certain  axis  that  passes  through 
the  fixed  point  and  is  called  the  instantaneous  axis  of  rotation. 
In  the  general  case,  the  position  of  the  instantaneous  axis  of 
rotation  varies  with  respect  to  both  a  fixed  reference  frame  and 
one  rigidly  fixed  in  the  rotating  body. 

The  velocity  v  of  arbitrary  point  M  of  such  a  body  is 

v  =  [tor]  and  v  =  cop. 


Here  ©  =  dtp/dt  is  the  angular  velocity  of  the  body  and  is  di¬ 
rected  along  the  instantaneous  axis  of  rotation  as  is  the  vector 
dtp  of  an  element  of  rotation  of  the  body  during  the  infinitely 
short  time  dt,  r  is  the  radius  vector  from  fixed  point  0,  about 
which  the  body  rotates,  to  point  M,  and  p  is  the  distance  from 
point  M  to  the  instantaneous  axis  of  rotation.  Such  a  body  can 
have  three  independent  motions:  it  can  rotate  about  each  of 
three  mutually  perpendicular  axes  passing  through  fixed  point 
O.  It  consequently  has  three  degrees  of  freedom  (1.2.7). 

1.5.7  The  angular  acceleration  vector  e  is  introduced  to  indicate 
the  rate  of  change  of  the  angular  velocity  vector  with  time  in 
nonuniform  rotation  of  a  body  about  a  fixed  axis  or  its  rotation 
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iibout  a  fixed  point.  Vector  e  is  equal/to  the  first  derivative  of 
the  angular  velocity  ©  of  the  body  with  respect  to  the  time  t: 

da> 

1  ~5F* 

I  i  the  body  rotates  about  a  fixed  axis,  vector  e  is  directed  along 
this  axis.  It  coincides  in  direction  with  ©  in  accelerated  rota¬ 
tion  (dw/dt  >  0)  and  is  opposite  to  ©  in  direction  in  decelerated 
rotation  ( du>/dt  <  0).  The  projection  of  the  angular  acceleration 
on  the  fixed  axis  OZ  of  rotation  is  equal  to 


where  ©2  is  the  projection  of  vector  ©  on  the  same  axis. 

1.5.8  As  distinct  from  the  angular  acceleration,  the  accelera¬ 
tion  a  of  arbitrary  point  M  of  a  rigid  body  rotating  about  fixed 
point  O,  or  about  a  fixed  axis  passing  through  point  0,  is  often 
called  the  linear  acceleration.  It  equals 

dv  d  ,  ,  . 

a_  — [©rj-ar-j-aaX, 

where  ar  =  [er]  is  the  rotary  acceleration  of  the  point,  and 
aax  1©  l©rl]  is  its  axipetal  acceleration,  which  is  directed 
toward  the  instantaneous  axis  of  rotation. 

If  the  body  rotates  about  the  fixed  axis  OZ  (Fig.  1.7),  the  rotary 
acceleration  of  point  M  coincides  with  its  tangential  accelera¬ 
tion  ax  (1.4.5),  and  its  axipetal  acceleration  coincides  with  its 
normal  acceleration  an  (1.4.6): 

at  =  [er]  =  [ep]  and  an  =  — ©2p. 

1.5.9  Any  complex  motion  of  a  rigid  body  can  be  broken  down 
into  two  simple  motions:  translational  motion  at  the  velocity 
\A  of  a  certain  arbitrarily  chosen  point  A  of  the  body  and  rota¬ 
tion  about  an  instantaneous  axis  passing  through  this  point.  The 
angular  velocity  ©  of  rotation  is  independent  of  the  choice  of 
point  A.  The  velocity  of  arbitrary  point  M  of  the  body  is 

v  =  vA  +  [©  (r  —  rA)], 

where  r  and  rA  are  the  radius  vectors  of  points  M  and  A . 
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In  the  dynamics  of  a  rigid  body  it  usually  proves  convenient  to 
deal  with  its  complex  motion  as  the  combination  of  two  simul¬ 
taneous  motions:  translational  motion  at  the  velocity  of  the 
body’s  centre  of  mass  (2.3.3)  and  rotation  about  the  centre  of 
mass. 

The  simplest  case  of  complex  motion  of  a  body  is  plane,  or 
plane-parallel,  motion,  in  which  all  the  points  of  the  body  trav¬ 
el  in  parallel  planes.  This  is  the  motion,  for  instance,  of  a 
homogeneous  circular  cylinder  rolling  down  an  inclined  plane. 
In  plane  motion  the  direction  of  the  instantaneous  axis  of  ro¬ 
tation  of  the  body  about  point  A  remains  constant  and  vectors 
®  and  \A  are  perpendicular  to  each  other. 


CHAPTER  2  NEWTON'S  LAWS 

2.1  Newton's  First  Law.  Inertial  Frames  of  Reference 

2.1.1  As  the  first  law  of  dynamics  Newton  took  a  law  that  had 
already  been  established  by  Galileo:  a  particle  persists  in  its 
state  of  rest  or  uniform  motion  in  a  straight  line  until  the  action 
of  other  bodies  compel  it  to  change  that  state. 

Newton’s  first  law  indicates  that  no  external  action  is  required 
to  maintain  a  state  of  rest  or  of  uniform  motion  in  a  straight 
line.  This  manifests  a  special  dynamic  property  of  bodies, 
called  their  inertia.  Newton’s  first  law  is  consequently  called 
the  law  of  inertia,  and  the  motion  of  a  body  not  subject  to  the 
action  of  other  bodies  is  said  to  be  inertial  motion. 

2.1.2  Mechanical  motion  is  relative:  its  nature  may  differ  for 
different  frames  of  reference  (1.2.1)  that  are  moving  in  relation 
to  each  other.  An  astronaut,  for  instance,  on  board  a  man-made 
satellite  orbiting  the  earth,  may  be  at  rest  in  a  reference  frame 
fixed  in  the  satellite.  At  the  same  time,  he  is  travelling  together 
with  the  satellite  with  respect  to  the  earth  along  an  elliptic  or¬ 
bit,  i.e.  not  with  uniform  motion  and  not  in  a  straight  line.  It  is- 
thus  obvious  that  Newton’s  first  law  does  not  hold  for  any  frame 
of  reference.  A  ball,  for  example,  lying  on  the  smooth  floor 
of  a  stateroom  in  a  ship  travelling  with  uniform  motion  at  sea 
in  a  straight  line,  may  suddenly  start  moving  along  the  floor 
without  being  subject  to  the  action  of  any  other  bodies.  This 


. .  2  j  FORGE _ _ _ ^ _ _ 35 

will  occur  when  the  velocity  of  the  ship,  i.e.  its  speed  and/or 
direction,  begins  to  change. 

A  frame  of  reference  with  respect  to  which  a  particle,  not  sub¬ 
ject  to  external  action,  remains  at  rest  or  moves  uniformly  in  a 
■straight  line  is  called  an  inertial  frame  of  reference.  The  content 
of  Newton’s  first  law  reduces,  in  essence,  to  two  statements: 
in  the  first  place,  that  all  material  bodies  possess  the  property 
of  inertia,  and  secondly,  that  inertial  reference  frames  exist. 

2.1.3  Any  two  inertial  reference  frames  can  move  only  with 
translational  motion  with  respect  to  each  other,  and  only  uni¬ 
formly  in  a  straight  line.  It  has  been  experimentally  established 
that  the  heliocentric  frame  of  reference  is  practically  inertial.  The 
origin  of  its  coordinate  system  is  located  at  the  centre  of  mass 
(2.3.3)  of  the  solar  system  (approximately  at  the  centre  of  the 
sun)  and  the  three  axes  extend  in  the  directions  of  three  dis¬ 
tant  stars  selected,  for  example,  so  that  these  coordinate  axes 
are  mutually  perpendicular. 

A  laboratory  frame  of  reference,  whose  coordinate  axes  are  rigidly 
fixed  in  the  earth,  is  noninertial  chiefly  owing  to  the  earth’s 
diurnal  (daily)  rotation.  But  the  earth  rotates  so  slowly  that 
the  maximum  normal  acceleration  (1.4.6)  of  points  on  its  sur¬ 
face  due  to  diurnal  rotation  does  not  exceed  0.034  m/sa.  There¬ 
fore,  in  the  great  majority  of  practical  problems,  the  laboratory 
reference  frame  can  be  regarded  as  being  approximately  iner¬ 
tial. 

2.1.4  Inertial  reference  frames  are  of  prime  importance  in  all 
branches  of  physics,  not  only  mechanics.  This  is  due  to  the 
fact  that  according  to  Einstein’s  principle  of  relativity  (5.1.2) 
the  mathematical  expression  of  any  physical  law  should  have 
the  same  form  in  all  inertial  reference  frames.  Hence,  in  the 
following  we  shall  make  use  of  only  inertial  frames  of  reference, 
without  stipulation  each  time.  The  laws  of  motion  of  a  particle 
with  respect  to  a  noninertial  reference  frame  is  dealt  with  in 
Chapter  7. 


2.2  Force 

2.2.1  Force  is  a  vector  quantity  which  is  a  measure  of  the  mechan¬ 
ical  action  exerted  on  the  body  being  considered  by  other 
bodies.  Mechanical  interaction  can  be  accomplished  either  be¬ 
tween  bodies  in  direct  contact  (for  instance,  in  friction,  and  in 


3* 


MEWTON’S  LAWS  [2 


3(5 


the  pressure  of  bodies  against  one  another),  or  between  remote 
bodies.  A  special  form  of  matter,  connecting  particles  of  matter 
into  unified  systems  and  transmitting  the  action  of  certain  par¬ 
ticles  on  others  at  finite  velocities,  is  called  a  physical  field,  or 
simply  field.  Interaction  between  remote  bodies  is  accomplished 
by  the  gravitational  or  electromagnetic  fields  they  set  up  (for 
example,  the  attraction  of  the  planets  to  the  sun,  and  the  inter¬ 
action  of  charged  bodies  or  current-carrying  conductors).  Me¬ 
chanical  action  exerted  on  a  given  body  by  other  bodies  can  be 
manifested  in  two  ways.  In  the  first  place,  it  can  lead  to  a  change 
in  the  state  of  mechanical  motion  of  the  body  being  consid¬ 
ered,  and  secondly,  to  its  deformation.  Both  of  these  elects  can 
serve  as  the  basis  for  measuring  forces.  For  example,  the  mea¬ 
surement  of  forces  by  means  of  a  spring  dynamometer  is  based 
on  Hooke’s  law  (40.3.4)  for  axial  tension. 

Using  the  concept  of  force,  we  usually  speak  in  mechanics  of 
the  motion  and  deformation  of  a  body  by  the  action  of  the  forces 
applied  to  it.  Here,  of  course,  each  force  always  corresponds 
to  a  certain  body  exerting  this  force  on  the  body  being  consid¬ 
ered. 

Force  F  is  completely  specified  by  giving  its  magnitude,  direc¬ 
tion  in  space  and  point  of  application.  The  straight  line  along 
which  a  force  is  exerted  is  called  its  line  of  action. 

A  field  acting  on  a  particle  with  the  force  F  is  said  to  be  steady, 
or  stationary,  if  it  does  not  change  in  the  course  of  time  t,  i.e. 
if  force  F  is  explicitly  independent  of  time  at  any  point  in  the 
field:  dF/dt  =  0.  For  a  field  to  be  steady  it  is  necessary  that 
the  bodies  which  set  it  up  are  at  rest  with  respect  to  the  ref¬ 
erence  frame  used  in  considering  the  field. 

2.2.2  The  simultaneous  action  of  several  forces,  Fx,  F2,  .  .  .,  Fn 
(Fig.  2.1a)  on  a  particle  M  is  equivalent  to  the  action  of  a  single 
force,  called  the  resultant  force  and  equal  to  the  vector  sum  of 
the  component  forces.  Thus 
n 

F=2  Ft. 

i= 1 

It  is  the  closing  line  of  the  polygon  of  forces  Fx,  F„  .  .  .,  Fn 
(Fig.  2.16). 

If  a  body  is  perfectly  rigid,  the  action  of  a  force  on  it  does  not 
change  when  the  point  of  application  is  displaced  along  the 
line  of  action,  but  within  the  limits  of  the  body.  In  other  words, 
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forces  applied  to  perfectly  rigid  bodies  can  be  regarded  as  non- 
loc.alized  vectors. 

•2.2.3  A  body  is  said  to  be  free  if  no  limitations  are  imposed  on 
i is  position  and  motion  in  space.  An  airplane,  for  example, 
Hying  in  the  air  is  a  free  body,  as  is  a  submarine  travelling  in 
the  depths  of  the  sea.  Most  bodies  that  we  deal  with  are  not 


(a) 

Fig.  2.1 


free;  certain  limitations  are  imposed  on  their  possible  positions 
and  motion.  These  limitations  are  called  constraints  in  mechan¬ 
ics.  A  ball,  for  example,  hung  on  a  nonextensible  thread,  can¬ 
not  move  away  from  its  point  of  suspension  by  more  than  the 
length  of  the  thread;  a  streetcar  (tram)  must  travel  only  along 
its  tracks.  Constraints  are  realized  due  to  the  action  on  the 
body  being  considered  exerted  by  other  bodies  attached  to  or 
contacting  this'  body  (for  instance,  the  thread  on  the  ball  tied 
to  it,  the  track  on  the  streetcar,  etc.). 

In  investigating  the  behaviour  of  constrained  bodies  or  systems 
of  bodies  in  mechanics,  the  principle  of  releasability  is  applied. 
It  states  that  a  restrained  body  (or  system  of  bodies)  can  be 
treated  as  a  free  body  (or  system)  if  the  bodies  imposing  the 
constraints  are  replaced  by  the  corresponding  forces.  These 
forces  are  called  reaction  forces  of  the  constraints,  or  constraint 
reactions.  All  other  forces  exerted  on  the  body  are  called  active 
forces.  Thus,  the  motion  of  the  ball  hung  by  the  thread  can  be 
treated  as  the  motion  of  a  free  ball  subject,  in  addition  to  all 
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the  active  forces  applied  to  it  (for  instance,  gravity),  to  the 
reaction  of  the  thread  as  well. 

In  contrast  to  the  active  forces,  which  should  he  given  in  each 
specific  problem,  the  constraint  reactions  are  not  known  before¬ 
hand.  They  are  to  be  determined  in  solving  the  problem.  Their 
values  should  be  such  that  under  the  combined  effects  of  the 
active  forces  and  constraint  reactions,  the  motion  of  the  “re¬ 
leased”  body  completely  agrees  with  the  limitations  imposed  by 
the  constraints  on  the  constrained  body  being  considered.  There 
are  no  other  distinctions  between  constraint  reactions  and  active 
forces. 

2.2.4  Bodies  not  included  in  the  mechanical  system  being  in¬ 
vestigated  are  called  external  bodies.  Forces  exerted  on  the 
system  by  external  bodies  are  called  external  forces. .  Corres¬ 
pondingly,  forces  of  interaction  between  the  parts  or  particles 
of  the  system  being  considered  are  called  internal  forces. 

A  mechanical  system  is  said  to  be  closed,  or  isolated,  if  it  does 
not  interact  with  external  bodies.  No  external  forces  act  on 
any  of  the  bodies  of  a  closed  system. 


2.3  Mass.  Momentum 

2.3.1  In  classical  (Newtonian)  mechanics,  the  mass  of  a  particle 
(material  point)  is  a  positive  scalar  quantity  which  is  a  measure 
of  the  inertia  of  this  particle.  Under  the  action  of  a  force  the 
velocity  of  a  particle  changes  gradually,  not  instantaneously, 
i.e.  it  acquires  an  acceleration  finite  in  magnitude  and  inversely 
proportional  to  the  mass  of  the  particle.  To  compare  the  m'asses, 
and  m2,  of  two  particles,  it  is  sufficient  to  measure  the 
magnitudes  a,  and  a2  of  the  accelerations  acquired  by  these 
particles  when  the  same  force  is  applied  to  them:  mjmi  = 
=  a,/a2.  The  mass  of  a  body  is  ordinarily  determined  by  weigh¬ 
ing  by  beam  scales. 

Classical  (Newtonian)  mechanics  contends  that: 

(a)  the  mass  of  a  particle  is  independent  of  its  state  of  motion, 
being  its  unalterable  characteristic; 

(b)  mass  is  an  additive  quantity,  i.e.  the  mass  of  a  system  (for 
instance,  a  body)  is  equal  to  the  sum  of  the  masses  of  all  the 
particles  making  up  the  system; 

(c)  the  mass  of  an  isolated  (closed)  system  (2.2.4)  decs  not 
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change  with  any  processes  occurring  within  the  system  (law  of 
conservation  of  mass). 

These  propositions  of  Newtonian  mechanics  were  reconsidered 
and  revised  in  relativistic  mechanics  (5.6.1,  5.6.2,  5.7.3  and 
5.7.6). 

2.3.2  The  density  p  of  a  body  at  a  given  point  M  is  the  ratio 
of  the  mass  dm  of  an  infinitely  small  element  of  the  body,  which 
includes  point  Af,  to  the  quantity  dV,  which  is  the  volume  of 
this  element.  Thus 


P  = 


dm 

w- 


The  size  of  the  element  being  considered  should  be  so  small 
that  any  variation  of  density  within  its  limits  can  be  neglected. 
On  the  other  hand,  this  size  should  be  a  great  many  times  larger 
than  intermolecular  distances. 

A  body  is  said  to  be  homogeneous  if  the  density  is  constant  at 
all  of  its  points.  The  mass  of  a  homogeneous  body  is  the  product 
of  its  density  by  its  volume:  m  =  p>V. 

The  mass  of  a  nonhomogeneous  body  is 


f  pdV, 

&) 

where  p  is  a  function  of  the  coordinates,  and  integrating  is 
carried  out  over  the  whole  volume  of  the  body.  The  average  den¬ 
sity  (p)  of  a  nonhomogeneous  body  is  the  ratio  of  its-  mass  to 
its  volume:  (p )  =  m/V. 

2.3.3  The  centre  of  inertia,  or  centre  of  mass,  of  a  system  of  par¬ 
ticles  is  point  C,  whose  radius  vector  rc  equals 


n 


i=l 


where  and  rt  are  tbo  mass  and  radius  vector  of  the  ith  par¬ 
ticle,  n  is  the  total  number  of  particles  in  the  system,  and  m  — 

n 

—  y1  mt  is  the  mass  of  the  whole  system, 

Pi 


40 


NEWTON’S  LAWS  [2 


The  velocity  of  the  centre  of  mass  is 


vc 


dtp  1 

dt  m 


2 


mixi  • 


2.3.4  The  vector  quantity  p;,  equal  to  the  product  of  the  mass 
mt  of  a  particle  by  its  velocity  Vf,  is  called  the  momentum,  of 
this  particle.  The  momentum  of  a  system  of  particles  is  the  vector 
p,  which  is  the  geometric  sum  of  the  momenta  of  all  the  par¬ 
ticles  making  up  the  system: 

n 

Vt- 

i=i 

The  momentum  of  a  system  is  equal  to  the  product  of  the  mass 
of  the  whole  system  by  the  velocity  of  its  centre  of  mass:  p  — 
=  m\c. 


2.4  Newton's  Second  Law 

2.4.1  The  basic  law  of  the  dynamics  of  a  particle  is  Newton’s 
second  law,  which  discusses  how  the  mechanical  motion  of  a 
particle  is  changed  by  the  effect  of  forces  applied  to  it.  New¬ 
ton's  second  law  states:  the  rate  of  change  in  the  momentum  p 
of  a  particle  is  equal  to  the  force  F  acting'on  it.  Thus 

1=F  or  -jL.  (mv)  =  F , 

where  m  and  v  are  the  mass  and  velocity  of  the  particle. 

If  several  forces  act  simultaneously  on  a  particle,  the  force  F 
in  Newton’s  second  law  is  understood  to  be  the  geometric  sum 
of  all  acting  forces,  both  active  forces  and  constraint  reactions 
(2.2.3),  i.e.  the  resultant  force  (2.2.2). 

2.4.2  The  vector  quantity  F  dt  is  called  an  element  of  impulse 
•of  force  F  during  the  infinitely  short  time  dt  of  its  action.  The 
impulse  of  force  F  acting  in  the  finite  time  interval  from  1  =  4 
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t , 

In  l  =  t2  is  equal  to  the  definite  integral  ^  F  dt,  where  F,  in 

f  1 

I  lie  general  case,  depends  upon  the  time  t. 

According  to  Newton’s  second  law,  the  change  in  the  momentum 
of  a  particle  is  equal  to  the  impulse  of  the  force  acting  on  it: 


U 

(lp--Fdt  and  Ap  =  p2  —  pt=jFdt, 

where  p2  —  p  f/2)  and  p,  =  p  (tj)  are  the  values  of  the  momen¬ 
tum  of  a  particle  at  the  end  (t  =  t2)  and  beginning  ( t  —  tt)  of 
I  lie  time  interval  being  considered. 

'2.4.3  Since  in  Newtonian  mechanics  the  mass  m  of  a  particle  is 
independent  of  its  state  of  motion,  dm/dt  =  0.  Hence,  the 
mathematical  expression  of  Newton’s  second  law  can  also  be 
presented  in  the  form 


n— 


F 


m 


» 


where  a  =  dv/dt  =  ddhldt2  is  the  acceleration  of  the  particle 
and  r  is  its  radius  vector.  The  corresponding  statement  of 
Newton's  second  law  reads:  the  acceleration  of  a  particle  coin¬ 
cides  in  direction  with  the  force  acting  on  it  and  is  equal  to  the 
ratio  of  this  force  to  the  mass  of  the  particle. 

The  tangential  and  normal  accelerations  of  a  particle  (1.4.4, 
1.4.5  and  1.4.6)  are  determined  by  the  corresponding  compo¬ 
nents  of  force  -F: 


«T=- 

ii  nd 


O'X  : 


dv 

dt 


Ft 
m  ’ 


a 


n 


J  n 
m 


It 


ZlL 

m 


■where  v  is  lie  magnitude  of  the  particle’s  velocity  vector,  and 
It  is  the  radius  of  curvature  of  its  path.  The  force  Fn,  imparting 
normal  acceleration  to  a,  particle,  is  directed  toward  the  centre 
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of  curvature  (1.2.4)  of  the  particle’s  path  and  is  therefore  called 
the  centripetal  force. 

2.4.4  If  a  particle  is  subject  to  the  simultaneous  action  of  sev¬ 
eral  forces,  Fj,  F2,  .  .  .,  Fn,  its  acceleration  is 


i= i  i=i 

where  a;  =  Fj/m.  Consequently,  each  of  the  forces  acting  si¬ 
multaneously  on  a  particle  imparts  to  it  the  acceleration  it 
would  have  if  there  were  no  other  forces  ( principle  of  superposi¬ 
tion,  or  independence,  of  forces). 

The  equation 


n 


1= 1 


is  called  the  differential  equation  of  motion  of  a  particle. 

In  projections  on  the  axes  of  a  rectangular  Cartesian  coordinate 
system,  this  equation  is  of  the  form 


m  ■ 


d2x 

dt* 


d*y 
dt * 


d2z 

=  Fy  and  m 


—  Fr. 


where  x,  y  and  z  are  the  coordinates  of  the  moving  particle. 


2.5  Newton's  Third  Law.  Motion  of  the  Centre  of 
Mass 

2.5.1  The  mechanical  action  exerted  by  bodies  on  one  another 
is  of  the  nature  of  their  interaction.  This  is  what  Newton  s  third 
law  deals  with:  two  particles  exert  forces  on  each  other  which 
are  equal  in  magnitude  and  opposite  in  direction  along  a  line 
connecting  these  particles. 

If  Fik  is  a  force  exerted  on  the  ith  particle  by  the  fcth  one,  and 
Fjj;  is  the  force  exerted  on  the  fcth  particle  by  the  ith  one,  then, 
according  to  Newton’s  third  law, 

=  ~Fih- 
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Forces  Fi;,  and  are  applied  to  different  particles  and  can  be 
mutually  counterbalanced  only  when  particles  i  and  k  belong 
to  the  same  perfectly  rigid  body. 

2.5.2  Newton’s  third  law  is  an  essential  supplement  to  the 
first  and  second  laws.  It  permits  us  to  go  over  from  the  dynam¬ 
ics  of  a  separate  particle  to  that  of  an  arbitrary  mechanical 
system  (system  of  particles).  It  follows  from  Newton’s  third 
law  that  in  any  mechanical  system  the  geometric  sum  of  all 
the  internal  forces  (2.2.4)  equals  zero: 

n  n 

2  2  F^=o 

i=l  k=  1 

where  n  is  the  number  of  particles  in  the  system,  and  F;i  =  0. 
The  vector  Fext,  equal  to  the  vector  sum  of  all  the  external 
forces  (2.2.4)  acting  on  the  system,  is  called  the  principal  vector 
of  external  forces: 
n 

pext  =  ^  Fext 
i— 1 

where  F®xt  is  the  resultant  external  force  applied  to  the  ith 
particle. 

2.5.3  It  follows  from  Newton’s  first  and  second  laws  that  the 
first  derivative  with  respect  to  the  time  t  of  the  momentum  p 
of  a  mechanical  system  (2.3.4)  is  equal  to  the  principal  vector 
of  all  external  forces  applied  to  the  system.  Thus 

P.  —  pext 
dt  ' 

This  equation  expresses  the  law  of  the  change  in  the  momentum 
of  a  system. 

Since  p  =  rovc,  where  m  is  the  mass  of  a  system  and  vc  is 
the  velocity  of  its  centre  of  inertia,  or  mass,  the  law  of  motion 
of  the  centre  of  mass  of  a  mechanical  system  is  of  the  form 

(m,vc)  =  Felt  or  mac  =  Fext, 

where  ac  =  dVc/d<  is  the  acceleration  of  the  centre  of  mass. 
Thus,  the  centre  of  mass  (centre  of  inertia)  of  a  mechanical 
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system  travels  like  a  particle  whose  mass  is  equal  to  the  mass 
of  the  whole  system  and  which  is  subject  to  a  force  equal  to 
the  principal  vector  of  the  external  forces  applied  to  the  system. 
If  the  system  being  considered  is  a  rigid  body  that  travels  with 
translational  motion  (1.5.1),  the  velocities  v;  of  all  the  points 
of  the  body  and  vc  of  its  centre  of  mass  are  the  same  and  are 
equal  to  velocity  v  of  the  body.  Correspondingly,  the  accelera¬ 
tion  of  the  body  a  =  ac  and  the  basic  equation  in  the  dynamics 
of  translational  motion  of  a  rigid  body  is  of  the  form 

ma  =  Fcxt. 


2.6  Motion  of  a  Body  of  Variable  Mass 

2.6.1  In  Newtonian  mechanics  the  mass  of  a  body  can  be 
changed  only  if  particles  of  matter  are  detached  from  or  added 
to  it.  An  example  of  such  a  body  is  a  rocket.  In  its  flight  the  mass 
of  the  rocket  is  gradually  reduced  because  the  gaseous  products  of 
combustion  in  the  rocket  engine  are  ejected  through  the  noz¬ 
zles. 

The  equation  of  translational  motion  of  a  body  of  variable  mass 
( Meshchersky  equation)  is 

m__=  Fcxt  +  (Vl-v)  —t 


where  m  and  v  are  the  mass  and  velocity  of  the  body  at  the  in¬ 
stant  of  time  being  considered,  Fext  is  the  principal  vector  of 
the  external  forces  (2.5.2)  acting  on  the  body,  and  v,  is  the 
velocity  of  the  detracted  particles  after  separation  from  the 
rocket  (if  dmldt  <  0)  or  the  added  particles  before  being  joined 
(if  dmldt  >  0). 

2.6.2  The  second  member  on  the  right-hand  side  of  the  Me¬ 
shchersky  equation  represents  the  additional  force  acting  on  a 
body  of  variable  mass.  It  is  called  the  reactive  force: 


„  .  , dm  dm 

Fr  =  (vi-v)-dT  =  u—  ’ 


dt 


where  u  =  Vj  —  v  is  the  relative  velocity  of  the  particles  being 
separated  from  or  joined  to  the  body,  i.e.  their  velocity  with 
respect  to  a  reference  frame  in  translational  motion  together 
with  the  bodv 
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The  reactive  force  characterizes  the  mechanical  action  exerted 
on  the  body  by  the  particles  separated  from  or  joined  to  it  (for 
instance,  the  effect  of  the  ejected  exhaust  gases  on  the  rocket). 
2.6.3  The  equation  of  motion  of  a  rocket  not  subject  to  the 
action  of  external  forces  is 

dv  dm 

III  — —  =  u  — —  . 


If  the  initial  velocity  of  the  rocket  is  zero  the  rocket  flies  in  a 
straight  line  in  the  direction  opposite  to  that  of  the  relative 
velocity  u  of  the  exhaust  gases  ejected  from  the  engine  nozzle. 
Here 

dv  dm 


and  at  it  =  const  the  relation  between  the  velocity  of  the  rocket 
and  its  mass  is  expressed  by  Tsiolkovsky's  formula 


i  mo 
v  —  u  In- — 
m 


where  ra0  is  the  initial,  or  launching,  mass  of  the  rocket. 

2.6.4  The  maximum  velocity  that  a  rocket  can  reach  in  the 
absence  of  external  forces  is  called  its  characteristic  velocity. 
This  velocity  is  acquired  at  the  moment  the  rocket  engine 
stops  because  it  has  used  up  all  the  fuel  and  oxidizer  that  were 
on  board, 


'•’max  — 11  ln 


m  o 

m0 —  m  f 


where  is  the  initial  mass  of  the  fuel  and  oxidizer. 

The  effects  of  the  earth’s  gravity  and  air  resistance  lead  to  an 
appreciable  reduction  in  the  maximum  velocity  actually  ac¬ 
quired  by  the  rocket,  during  the  operation  of  its  engine,  as  com¬ 
pared  to  the  characteristic  velocity. 

2.6.5  The  characteristic  velocity  of  a  multistage  (combination) 
rocket  is 


n 

cm3x=  2j  u  1 

i=l 
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where  n  is  the  total  number  of  stages  of  the  rocket,  mu  is  the 
mass  of  the  fuel  and  oxidizer  intended  for  the  operation  of  the 
ith  stage  engine,  ui  is  the  exhaust  gas  velocity  (nozzle  velocity) 
of  the  ith  stage  engine,  and  moi  is  the  launching  mass  of  the 
multistage  rocket,  including  all  the  stages  from  the  ith  to  the 
nth.  The  velocity  increase  of  a  multistage  rocket,  compared  to 
that  of  a  single-stage  one  having  the  same  launching  mass  and 
the  same  fuel  and  oxidizer  capacity,  is  due  to  the  reduction  in 
the  mass  of  the  rocket  as  the  first,  second  and  subsequent  stages 
are  consecutively  jettisoned  when  their  fuel  has  been  ex¬ 
hausted. 


2.7  Law  o!  Conservation  of  Momentum 


2.7.1  The  law  of  conservation  of  momentum  is:  the  momentum  p 
of  a  closed,  or  isolated,  system  does  not  change  in  the  course 
of  time,  i.e. 


=  0  and  p=  const. 


In  contrast  to  Newton’s  laws,  the  law,  or  principle,  of  conser¬ 
vation  of  momentum  is  valid  not  only  within  the  framework  of 
classical  mechanics.  It  belongs  to  the  most  fundamental  of 
physical  laws  because  it  is  concerned  with  a  definite  property 
of  the  symmetry  of  space:  its  homogeneity.  The  homogeneity 
of  space  is  manifested  in  the  fact  that  the  physical  properties 
of  a  closed  system  and  the  laws  of  its  motion  are  independent 
of  the  choice  of  position  of  the  origin  of  the  inertial  reference 
frame,  i.e.  these  properties  and  laws  are  unchanged  by  a  parallel 
translation  of  the  closed  system  as  a  whole  in  space.  According 
to  modern  concepts,  fields  can  also  have  momentum,  not  only 
particles  and  bodies.  Light,  for  example,  exerts  a  pressure  on 
the  surface  of  a  body  that  reflects  or  absorbs  it  because  the  ele¬ 
ctromagnetic  field  of  the  light  wave  has  momentum. 

2.7.2  In  application  to  systems  described  by  classical  (Newto¬ 
nian)  mechanics,  the  principle  of  conservation  of  momentum 
can  be  regarded  as  a  consequence  of  Newton’s  laws.  For  a  closed 
mechanical  system  the  principal  vector  of  external  forces 
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Fext  =  0  and  the  principle  of  conservation  of  momentum  follows 
from  (2.5.3): 

n 

p=2  =  const, 

i=l 

where  mi  and  v*  are  the  mass  and  velocity  of  the  ith  particle  of 
the  system,  which  consists  of  n  particles. 

Consequently,  there  is  also  no  change  in  the  projections  of  the 
momentum  of  a  closed  system  onto  the  axes  of  Cartesian  coordi¬ 
nates  of  an  inertial  reference  frame: 
n 

Px  —  2  mivix  =  const, 

2=1 


n 

Py=  2  mlviy  —  const , 
i=l 
n 

Pz=  miViz  —  const. 

i=l 

The  momentum  of  the  system  is  p  =  mvc,  where  m  is  the  mass 
of  the  system  and  vc  is  the  velocity  of  its  centre  of  mass  (2.3.4). 
It  follows,  therefore,  from  the  principle  of  conservation  of  mo¬ 
mentum  that  in  any  processes  occurring  within  a  closed  system 
the  velocity  of  its  centre  of  mass  does  not  change:  vc  =  const. 
2.7.'3  If  a  system  is  not  closed,  but  the  external  forces  acting 
on  it  are  such  that  their  principal  vector  is  identically  zero 
(Fext  ==  0),  then,  according  to  Newton’s  laws  (2.5.3),  the  mo¬ 
mentum  of  the  system  does  not  change  with  time:  p  = 
=  const. 

As  a  rule,  Fext  o  and  p  =^=  const.  But  if  the  projection  of  the 
principal  vector  of  external  forces  onto  any  fixed  axis  is  iden¬ 
tically  zero,  the  projection  onto  the  same  axis  of  the  momentum 
vector  of  the  system  does  not  change  with  time.  Thus,  px  =  const 
under  the  condition  that  Fe*1  ==  0.  If,  for  instance,  no  other 
external  forces,  except  gravity,  are  acting  on  a  system,  the 
horizontal  component  of  the  momentum  of  the  system,  perpen¬ 
dicular  to  the  direction  of  the  gravitational  force,  remains  con¬ 
stant. 
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2.7.4  In  some  processes  (e.g.  collisions  or  in  tiring  shells,  etc.), 
the  momenta  of  the  parts  of  the  system  undergo  great  changes  in 
relatively  short  time  intervals.  This  is  due  to  internal  forces  of 
interaction,  of  short  duration,  but  of  quite  considerable  magni¬ 
tude,  developed  within  the  system.  In  comparison  with  these 
forces,  all  external  forces  acting  continuously  on  the  system 
(for  instance,  gravity)  are  small.  In  such  processes,  the  action 
of  the  system  of  external  forces  can  usually  be  ignored,  i.e.  it 
can  be  assumed  that  the  momentum  of  the  system  as  a  whole  is 
not  changed  in  the  process  being  considered. 


2.8  Galilean  Transformations.  Mechanical  Principle 
of  Relativity 

2.8.1  A  Galilean  transformation  is  a  transformation  of  coordi¬ 
nates  and  time  that  is  applied  in  Newtonian  mechanics  in 
going  over  from  one  inertial  frame  of  reference  K  ( x ,  y,  z,  t) 
to  another  K'  (x  ,  y' ,  z',  t')  which  travels  with  translational 
motion  with  respect  to  K  at  the  constant  velocity  V.  The  Gali¬ 
lean  transformation  is  based  on  axioms  concerning  the  absolute 
nature  of  time  intervals  and  of  lengths.  The  first  axiom  con¬ 
tends  that  the  passage  of  time  (time  interval  between  any  two 
events)  is  the  same  in  all  inertial  frames  of  reference.  According 
to  the  second  axiom,  the  dimensions  of  a  body  are  independent 
of  its  velocity  of  motion  with  respect  to  the  frame  of  ref¬ 
erence  . 

If  the  like  axes  of  Cartesian  coordinates  of  inertial  reference 
frames  K  and  IC  are  drawn  parallel  to  one  another  pairwise 
and  if  at  the  initial  instant  of  time  (t  =  t'  =  0)  the  origins  O 
and  O'  of  coordinates  coincide  with  each  other  (Fig.  2.2),  the 
Galilean  transformations  are  of  the  form 

x'  =  x  —  Vxt,  y'  =  y  —  Vyt,  z'  =  z  —  Vzt  and  t'  —  t 
or 

r'  =  r  —  Vf  and  t'  =  t, 

where  x,  y,  z  and  x' ,  y' ,  z'  are  the  coordinates  of  point  M  in  the 
frames  of  reference  K  (at  the  instant  of  time  t)  and  K'  (at  the 
instant  of  time  t'  =  t),  r  and  r'  are  the  radius  vectors  of  point 
M  in  the  same  frames  of  reference,  and  Vx,  Vy  and  V z  are  the 
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projections  of  the  velocity  V  of  reference  frame  K'  onto  the 
coordinate  axes  of  reference  frame  K. 

The  coordinate  axes  are  usually  arranged  so  that  frame  of  ref¬ 
erence  K'  travels  along  the  positive  direction  of  axis  OX 
(Fig.  2.3).  Under  this  condition  the  Galilean  transformations 
arc  of  their  simplest  form: 

x'  —  x  —  Vt,  y'  =  y,  z'  =  z  and  t'  =  t. 

2.8.2  A  consequence  of  the  Galilean  transformations  is  the 
following  law  for  transforming  the  velocity  of  arbitrary  point 


M  (Fig-  2.2)  in  going  over  from  one  inertial  frame  of  reference 
\(the  velocity  of  the  point  is  v  =  dr/dt)  to  another  K'  (in 
which  the  velocity  of  the  same  point  is  v'  =  dr'ldt): 

v'  ==  v  —  V. 

J  he  projections  of  the  velocity  on  like  coordinate  axes  are 
transformed  accordingly: 

VX,  —  VX  —  Vx,  Vy,=  Vy -  Vy  SWl  V;,  =  -  V  ^ 

In  particular,  when  reference  frame  K'  travels  along  the  posi¬ 
tive  direction  of  axis  OX  (Fig.  2.3) 

v'x>  —  vx  ~  F,  v'y,  =  vy  and  v'z,=  vz. 
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The  accelerations  of  point  M  in  frames  of  reference  K  (a  = 
=  dv/dt)  and  K'  (a'  =  dx'/dl)  are  the  same:  a'  =  a. 

Hence,  the  acceleration  of  a  particle  is  independent  of  the  choice 
of  the  inertial  frame  of  reference  or,  in  other  words,  it  is  inva¬ 
riant  with  respect  to  Galilean  transformations. 

2.8.3  The  forces  of  interaction  between  particles  depend  only 
upon  their  relative  position  and  their  velocity  with  respect  to 
one  another.  The  relative  position  of  any  two  particles,  2  and 
1,  is  represented  by  a  vector  equal  to  the  difference  between  the 
radius  vectors  of  the  two  particles,  i.e.  to  the  vector  r21  =  r2  — 
—  rx  in  reference  frame  K  and  to  the  vector  r21  =  r2  —  rj  in 
reference  frame  K'.  It  follows  from  the  Galilean  transformations 
that  r21  =  r21.  Therefore,  the  distance  between  particles  1  and  2 
is  the  same  in  reference  frames  K  and  K 

r2i  =  Gu  or  (*2  —  4)*  +  (v'i  —  y'i)2  +  (4  —  4)2 

=  (x2  —  xj)2  +  (y2  —  i/i)2  +  (z2  —  zx)2. 

The  velocity  of  particle  2  with  respect  to  particle  1  is  equal  to 
the  difference  of  the  velocities  of  these  two  particles:  v2  —  vx  (in 
reference  frame  K)  and  v2— v[  (in  reference  frame  K').  It  follows 
from  the  Galilean  transformations  that  v2  —  v[  =  v2  —  vx. 
Thus,  the  relative  position  and  the  velocity  of  relative  motion 
of  any  two  particles  are  independent  of  the  choice  of  inertial 
reference  frame;  they  are  invariant  under  a  Galilean  transfor¬ 
mation.  Accordingly,  the  forces  acting  on  a  particle  are  also 
invariant  under  a  Galilean  transformation:  F'  =  F. 

2.8.4  Equations  expressing  Newton’s  laws  (2.4.3  and  2.5.1)  are 
invariant  with  respect  to  Galilean  transformations,  i.e.  their 
form  is  unchanged  in  transforming  the  coordinates  and  time 
from  one  inertial  frame  of  reference  ( K )  to  another  (K')\ 

ma  =  F  and  ¥ki  —  —Fik  (in  reference  frame  K), 

m'a'  =  F'  and  F/( ;  =  -  FU  (in  reference  frame  K'), 

where  m'  =  m  is  the  mass  of  the  particle  being  considered  and  is 
the  same  in  all  frames  of  reference. 

Thus,  the  mechanical  principle  of  relativity  ( Galileo's  principle 
of  relativity)  is  valid  in  classical  mechanics.  It  contends  that 
the  laws  of  mechanics  are  the  same  in  all  inertial  frames  of  ref¬ 
erence.  This  implies  that  under  the  same  conditions  all  mechan¬ 
ical  processes  proceed  in  the  same  way  in  different  frames  of 


3.1]  ENERGY,  WORK  AND  POWER 


51 


reference.  Consequently,  it  is  impossible  to  establish,  by  means 
of  any  mechanical  experiments  conducted  within  a  closed  sys¬ 
tem  of  bodies,  whether  the  system  is  at  rest  or  is  travelling  at 
uniform  velocity  in  a  straight  line  (with  respect  to  any  inertial 
frame  of  reference  whatsoever). 

The  mechanical  principle  of  relativity  points  to  the  fact  that 
all  inertial  frames  of  reference  are  equally  valid  in  mechanics. 
It  is  impossible  to  single  out  any  special,  or  “main”,  inertial 
frame  of  reference  with  respect  to  which  the  motion  of  bodies 
could  be  regarded  as  “absolute  motion”. 

2.8.5  The  principle  of  relativity  was  generalized  to  include  all 
physical  phenomena  by  Albert  Einstein  in  his  special  theory  of 
relativity  (5.1.2).  It  was  found  that  coordinates  and  time  in 
various  inertial  frames  of  reference  are  related  by  the  Lorentz 
transformations  (5.3.2)  rather  than  the  Galilean  transforma¬ 
tions.  But  at  low  velocities  of  relative  motion  of  the  frames  of 
reference  (in  comparison  to  the  velocity  of  light  in  free  space), 
the  Lorentz  transformations  become  the  Galilean  transforma¬ 
tions. 


CHAPTER  3  WORK  AND  MECHANICAL  ENERGY 


3.1  Energy,  Work  and  Power 

3.1.1  Energy  is  a  scalar  physical  quantity,  which  is  a  common 
measure  of  the  various  forms  of  motion  of  matter  dealt  with  in 
physics.  The  energy  of  a  system  quantitatively  characterizes 
this  system  with  respect  to  the  conversions  of  motion  that  are 
possible  within  it.  These  conversions  take  place  due  to  the  in¬ 
teraction  between  parts  of  the  system,  both  with  one  another 
and  with  external  bodies  (the  environment).  The  concept  of 
various  forms  of  energy  has  been  introduced  in  physics  to  ana¬ 
lyze  the  qualitatively  different  forms  of  motion  and  the  interac¬ 
tions  corresponding  to  them.  These  include:  mechanical  (3.4.1), 
internal  (9.1.2),  electromagnetic  (31.2.1),  nuclear  (42.2.2)  and 
various  other  forms  of  energy. 

3.1.2  The  mechanical  motion  of  a  body  is  changed  by  forces 
exerted  on  the  body  by  other  bodies.  Used  in  mechanics  for  a 
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quantitative  description  of  such  a  process  of  energy  exchange 
between  interacting  bodies  is  the  concept  of  the  work  done  by 
the  forces  applied  to  the  body  being  considered.  An  element  of 
work  done  by  force  F  in  the  infinitely  small  displacement  dr  is 
the  scalar  quantity 

§IF  =  F  dr  =  Fv  dt, 

where  r  and  v  =  dr/dt  are  the  radius  vector  and  velocity  of  the 
point  of  the  body  at  which  the  force  is  applied,  and  dt  is  the 
infinitely  short  time  interval  during  which  force  F  performs  the 
work  5TF  (the  meaning  of  the  notation  6  IF  is  given  in  3.1.8). 
In  rectangular  Cartesian  coordinates 

6JF  =  Fxdx  +  Fy  dy  +  F z  dz  =  (Fxvx  +  Fyvy  -j-  Fzvz)  dt, 

where  x,  y  and  z  are  the  coordinates  of  the  point  of  application 
of  the  force,  and  Fx,  Fy,  F z  and  vx,  vy,  vz  are  the  projections  of 
vectors  F  and  v  on  the  coordinate  axes. 

3.1.3  The  expression  for  an  element  of  work  can  also  be  pre¬ 
sented  in  the  form 

6IF  =  F  ds  cos  a  —  Fx  ds, 

where  ds  =  |  dr  [  is  an  element  of  the  length  of  the  path  of  the 
point  of  force  application  during  the  infinitely  short  time  in¬ 
terval  dt,  a  is  the  angle  between  vectors  F  and  dt,  and  Fx  = 
=  F  cos  a  is  the  projection  of  the  force  in  the  direction  of  the 
displacement  dr.  A  force  normal  to  the  path  of  its  point  of  appli¬ 
cation  performs  no  work. 

Force  F  is  called  the  motive  force  if  Fx  >  0,  so  that  5 IF  >  0. 
If  Ft  <0  (so  that  6  IF  <  0),  force  F  is  called  the  retarding  force 
( force  of  resistance). 

3.1.4  If  a  mechanical  system  is  subject  to  the  simultaneous 
action  of  forces  Flt  F2,  .  .  .,  Fn,  the  work  8lF  done  by  them 
during  the  infinitely  short  time  dt  is  equal  to  the  algebraic  sum 
of  the  work  done  during  the  same  time  dt  by  each  of  the  forces 
separately.  Thus 

6IF  =  §  6lFi=  2  *i=  E  Fiyi  dt’ 

1=1  1=1  1=1 

where  r;  and  v;  are  the  radius  vector  and  velocity  of  the  point 
of  application  of  force  Fj. 
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For  example,  for  a  particle  r;  =  r  is  the  radius  vector  of  the 
particle  and  v;  =  v  is  its  velocity.  Accordingly,  6 IF  =  F  dr  = 

n 

=  Fv  dt,  where  F  =  V  F;  is  the  resultant  force  (2.2.2).  It 
i=l 

follows  from  Newton’s  second  law  (2.4.1)  that  for  a  particle 
8  IF  =  v  dp, 

where  p  =  mv  is  the  momentum  of  the  particle,  and  m  is  its 
mass. 

In  the  translational  motion  of  a  perfectly  rigid  body  drt  = 
=  drc  and  =  vc,  where  rc  and  vc  are  the  radius  vector  and 
velocity  of  the  centre  of  mass  (2.3.3)  of  the  body. 

The  work  done  by  internal  forces  in  any  motion  of  a  perfectly 
rigid  body  equals  zero.  Hence,  in  the  translational  motion  of 
such  a  body  8 IF  =  Fext  drc  =  Fextvc  dt,  where  Fext  is  the 
principal  vector  of  external  forces  (2.5.2).  It  follows  from  the 
law  of  motion  of  the  centre  of  mass  (2.5.3)  that 

8  IF  =  vc  dp, 


where  p  =  m\c  is  the  momentum  of  a  rigid  body  of  mass  m 
travelling  at  the  velocity  v  =  vc. 

3.1.5  The  work  IF  done  by  force  F  along  a  finite  portion  of  the 
path  A  of  its  point  of  application  is  equal  to  the  algebraic  sum 
of  the  work  done  over  all  the  infinitely  small  portions  of  this 
path,  i.'e.  it  is  expressed  by  the  line  integral 

8 

W=  ^  F  dr=  j  Ft  ds , 

(i-)  0 

where  s  is  the  length  of  arc  measured  off  along  the  path  from 
the  beginning  of  the  portion  of  the  path  being  considered,  and 
Fx  is  the  projection  of  the  force  onto  the  direction  of  displace¬ 
ment  dr  of  the  point  of  force  application.  To  calculate  this 
integral  it  is  necessary  to  know  the  dependence  of  Fx  on  s  along 
the  given  path  L.  If  this  dependence  is  represented  graphically 
(Fig.  3.1),  the  required  work  IF  is  proportional  to  the  hatched 
area  in  Fig.  3,1. 

3.1.6  Potential- field  ( conservative )  forces  are  ones  whose  work 
depends  only  on  the  initial  and  final  positions  of  their  points 
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of  application;  it  depends  neither  on  the  shape  of  the  path  of 
these  points  nor  on  their  laws  of  motion  along  this  path. 

For  example,  the  forces  of  interaction  between  the  parts  (i.e. 
particles)  of  a  system  are  said  to  be  potential  (conservative) 
forces  if  they  depend  only  upon  the  configuration  of  the  system, 
i.e.  on  the  relative  position  of  all  its  particles.  The  work  done 
by  these  forces  in  moving  the  system  from  one  arbitrary  position 
to  another  is  independent  of  the  kind  of  displacement,  but  is 


completely  determined  by  the  initial  and  final  configurations 
of  the  system.  Examples  are  the  forces  of  electrostatic  and  grav¬ 
itational  interaction. 

A  steady  (stationary)  field  (2.2.1)  is  said  to  be  potential,  or  con¬ 
servative,  if  force  F,  which  it  exerts  on  a  particle  located  in 
the  field,  is  a  potential,  or  conservative  force.  This  implies 
that  force  F  depends  only  upon  the  position  of  the  particle  in 
the  field,  and  that  the  amounts  of  work  done  by  force  F  in 
displacing  the  particle  from  one  arbitrary  position  ( 1 )  to  ano¬ 
ther  (2)  (Fig.  3.2)  along  any  two  paths,  for  instance,  la2  (work 
Wias )  and  lb2  (work  Wiys),  are  the  same: 

2 

W/a^WJb2=j  Fdr. 

i 

Accordingly,  the  work  done  by  a  conservative  force  in  displac¬ 
ing  its  point  of  application  along  any  closed  path  L  (for  exam¬ 
ple,  la2bl)  is  equal  to  zero: 


:i.i]  ENERGY,  WORK  AND  POWER 


55 


lii  the  general  case,  the  external  bodies  that  set  up  the  field 
being  considered  can  be  in  motion  with  respect  to  an  inertial 
frame  of  reference,  so  that  their  field  is  not  stationary,  i.e.  force 
F  depends  explicitly  on  time:  dF/di  ^  0.  A  nonstationary  field 
is  a  conservative  one  if  the  work  done  by  force  F  upon  instanta¬ 
neous  displacement  of  its  point  of  application  along  any  closed 
[iath  L  equals  zero: 

F  dr  s  0. 

(i) 

1 1  ere  F  depends,  not  only  on  the  coordinates  of  the  point  of  appli¬ 
cation,  but  on  the  time  as  well.  To  calculate  this  integral,  how¬ 
ever,  time  is  assumed  to  be  a  fixed  parameter. 

3.1.7  Among  the  nonconservative  forces  are  dissipative  and 
gyroscopic  forces.  Dissipative  forces  are  ones  whose  total  work 
done  in  displacing  a  closed  system  in  any  way  is  always  nega¬ 
tive.  Such,  for  instance,  are  the  forces  of  sliding  friction  and 
the  forces  retarding  the  motion  of  bodies  in  liquids  and  gases. 
Tn  contrast  to  conservative  forces,  dissipative  forces  depend, 
not  only  on  the  relative  positions  of  the  interacting  bodies,  but 
on  their  relative  velocities  as  well. 

Gyroscopic  forces  are  ones  that  depend  upon  the  velocity  of  the 
['article  on  which  they  act.  They  are  directed  perpendicular  to 
this  velocity.  An  example  is  the  Lorentz  force  (24.1.1)  exerted 
by  a  magnetic  field  on  a  charged  particle  travelling  in  this 
field.  The  work  done  by  gyroscopic  forces  is  always  equal  to 
zero,  regardless  of  how  the  particle  is  displaced. 

A  mechanical  system  (system  of  particles)  is  said  to  be  conser¬ 
vative  if  all  the  internal  forces  in  it  are  conservative  ones  and 
all  the  external  forces  are  both  conservative  and  stationary. 
Systems  that  do  not  satisfy  these  conditions  are  said  to  be  non¬ 
conservative. 

3.1.8  An  element  of  work  done  by  force  F,  exerted  on  a  particle 
by  a  stationary  conservative  field,  can  be  represented  in  the 
I'orm  of  an  exact  differential  of  a  scalar  function  ®  ( x ,  y,  z)  of 
the  coordinates.  This  is  called  the  force  function  of  the  field: 


F  dr  =  d®  or  Fx  dx  -\-  Fy  dy  -]-  Fz  dz 


dtp 

dx 


d<P 

dx+  -^-dy-\- 


dCD 

dz 


dz. 
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Hence 


Fx 


dd> 
dx  ’ 


and 


F  =  grad  <D . 


These  last  relations  are  also  valid  for  a  nonstationary  conser¬ 
vative  field,  whose  force  function  depends,  not  only  on  the 
coordinates,  but  on  time  as  well:  <I>  =  <P  (x,  y,  z,  t).  But  here 


F  dt—d(p 


dt 


dt. 


An  element  of  work  done  by  a  nonconservative  force  cannot  be 
represented  in  the  form  of  an  exact  differential  of  any  function 
whatsoever  of  the  coordinates.  This  is  precisely  why  an  element 
of  work  done  by  an  arbitrary  force  is  denoted  by  6  W. 

3.1.9  The  concept  of  power  is  used  in  mechanics  to  characterize 
the  amount  of  work  done  in  unit  time.  Power  ( instantaneous  pow¬ 
er)  is  the  scalar  physical  quantity  P.  It  is  equal  to  the  ratio 
of  the  element  of  work  SB7  to  the  infinitely  short  time  interval 
dt  during  which  it  is  performed: 

p-M. 

dt  * 


If  F  is  the  force  doing  the  work  6 IF,  the  power  is  the  scalar  pro¬ 
duct  of  force  F  by  the  velocity  v  of  its  point  of  application: 

P  =  Fv  =  Fx  v. 


In  the  general  case,  power  may  vary  with  time. 

The  average  power  in  the  time  interval  from  t  to  t  +  At  is  the 
physical  quantity  (P),  which  is  equal  to  the  ratio  of  the  work 
W  done  during  this  time  interval  to  the  length  of  this  inter¬ 
val  At: 


3.2  Kinetic  Energy 

3.2.1  The  kinetic  energy  of  a  body  is  the  energy  of  its  mechanical 
motion.  A  change  in  the  kinetic  energy  E it  of  a  particle  due  to 
the  action  of  force  F  is  equal  to  the  work  done  by  this  force: 

dEk  =  dW  —  v  dp, 
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where  p  =  mx  is  the  momentum  of  the  particle,  and  m  and  v 
are  its  mass  and  velocity.  In  Newtonian  mechanics  m  =  const 
and  the  expression  for  the  kinetic  energy  of  a  particle  is  of  the 
form 

mv2  mv 2 

Ek==~2~~~T~  • 


Kinetic  energy  in  relativistic  mechanics  is  discussed  in  Sect. 
5.7.1. 

3.2.2  The  kinetic  energy  of  a  mechanical  system  is  equal  to  the 
sum  of  the  kinetic  energies  of  all  the  parts  of  the  system.  For  a 
system  consisting  of  n  particles,  for  instance, 


i=  1 


2 

2=1 


my. 


where  mt  and  v;  are  the  mass  and  velocity  of  the  ith  particle  in 
the  system. 

The  kinetic  energy  of  a  body  is 

j  pv2  dV  —  -^-  J  p v2dV, 

( V )  (V) 

where  v  is  the  velocity  of  points  of  the  infinitely  small  element 
of  volume  dV  of  the  body,  whose  density  is  p  and  mass  is  dm  = 
=  p dV.  Integrating  is  carried  out  throughout  the  whole  volume 
V-  of  the  body.  If  a  perfectly  rigid  body  of  mass  m  travels  with 
translational  motion  at  the  velocity  v,  its  kinetic  energy  is 
E k  =  mv2/ 2.  The  kinetic  energy  of  rotating  bodies  is  discussed 
in  Sects.  4.3.3  and  4.3.5. 

3.2.3  The  change  in  the  kinetic  energy  of  a  mechanical  system 
is  equal  to  the  algebraic  sum  of  the  amounts  of  work  done  by  all 
the  external  and  internal  forces  acting  on  the  system  (2.2.4). 
Thus 

=  6lFext  -f-  6Wint. 

For  example,  for  a  system  consisting  of  n  particles 

d£k=  V  Fextdr.+  yj  vtkdrt, 

i=l  1=1  ft=l 
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where  r;  is  the  radius  vector  of  the  ith  particle,  F®?4  is  the  re¬ 
sultant  of  the  external  forces  acting  on  this  particle,  and  F;;  = 


If  the  system  is  not  deformed,  then  the  work  done  by  the  inter¬ 
nal  forces 


dppint  =  0  and  dE k  =  dTFext. 


For  example,  the  change  in  the  kinetic  energy  of  a  perfectly 
rigid  body  in  translational  motion  is 
dEk  =  Fext*, 


where  Fext  is  the  principal  vector  of  external  forces  (2.5.2),  and 
dr  is  the  vector  of  an  element  of  displacement  of  the  body. 
3.2.4  The  kinetic  energy  of  a  mechanical  system  depends  upon 
the  choice  of  the  frame  of  reference.  If  in  the  inertial  reference 
frame  K  the  kinetic  energy  of  the  system  equals  E k,  and  in  the 
reference  frame  K' ,  travelling  with  translational  motion  at  the 
velocity  V  with  respect  to  K,  it  is  equal  to  F'k,  then 


Ttl  V2 

Ek  =  E^+-^-+ p'V, 


where  m  is  the  mass  of  the  system,  p'  =  m\'c  is  the  momentum 
of  the  system  in  its  motion  with  respect  to  reference  frame  K', 
and  vc'  is  the  velocity  of  the  system’s  centre  of  mass  with  re¬ 
spect  to  K' .  This  equation  is  valid  both  for  the  case  when  V  = 
=  const,  i.e.  when  K'  is  an  inertial  frame  of  reference,  as  well 
as  for  d\/dt  =£  0. 

In  particular,  if  the  frame  of  reference  K'  travels  with  transla¬ 
tional  motion  with  respect  to  reference  frame  K  at  the  velocity 
vc  of  the  centre  of  mass  of  the  system,  i.e.  V  =  vc,  then  \'c  = 
=  0  and 


E  k  = 


mV(, 

___ 


+n. 


This  equation  is  an  expression  of  Koenig's  theorem:  the  kinetic 
energy  of  a  mechanical  system  is  equal  to  the  sum  of  the  kinetic 
energy  that  a  particle  would  have  if  its  mass  was  that  of  the 
whole  system  and  if  it  travelled  at  the  velocity  of  the  system’s 
centre  of  mass,  plus  the  kinetic  energy  of  the  same  system  in  its 
motion  with  respect  to  a  translational  reference  frame  (i.e.  one 
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travelling  with  translational  motion)  having  its  origin  at  the 
centre  of  mass. 

1 1  follows  from  Koenig’s  theorem  that  the  kinetic  energy  of  a 
perfectly  rigid  body  is  equal  to  the  kinetic  energy  of  translation¬ 
al  motion  of  this  body  at  the  velocity  of  its  centre  of  mass  plus 
the  kinetic  energy  of  rotation  of  the  body  about  its  centre  of 
mass. 


3.3  Potential  Energy 

11.3.1  Potential  energy  is  the  part  of  the  energy  of  a  mechanical 
system  that  depends  only  on  its  configuration,  i.e.  on  the  rela¬ 
tive  arrangement  of  all  the  particles  of  the  system  and  their 
positions  in  the  external  potential  (conservative)  field  (3.1.6). 
The  decrease  in  the  potential  energy  when  the  system  is  moved 
from  arbitrary  position  1  to  another  arbitrary  position  2  is 
measured  by  the  work  WJ2  done  in  this  displacement  by  the 
conservative  forces  (internal  and  external)  acting  on  the  system. 
Thus 

tfp  (1)  -  Ev  (2)  =  W12, 

where  Ep  (1)  and  Ep  (2)  are  the  values  of  the  potential  energy  of 
the  system  in  its  initial  and  final  positions.  Accordingly,  the 
work  done  by  conservative  forces  upon  an  infinitely  small 
change  in  the  configuration  of  the  system  is  =  —dEp. 
Note.  It  is  assumed  here  that  the  external  conservative  forces 
are  stationary  (steady),  i.e.  they  can  change  with  time  only  as 
the  consequence  of  a  change  of  the  position  of  the  system  being 
considered  with  respect  to  the  frame  of  reference.  Otherwise 

„  dEv 

dEp=  — 6W  +  -g~dt. 


In  the  simplest  case,  when  the  system  is  a  particle  located  in  a 
conservative  field,  the  relation  between  force  F,  acting  on  the 
particle,  and  the  potential  energy  Ep  of  this  particle  in  the 
field  is  of  the  form 


dE  p  dE  p 

Fx=  IT’  W 

and  F  =  —  grad  £p. 


F7  = 


dE  p 
dz 
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The  potential  energy  Ep  of  a  particle  is  related  to  the  force 
function  (3.1.8)  of  the  corresponding  conservative  field  by  the 
equation 

dEv  =  —  dQ>  or  Ep  (x,  y,  z,  i )  =  —  <D  (, x ,  y,  z,  t)  +  C, 
where  C  is  the  integration  constant. 

3.3.2  The  equations  in  3.3.1  enable  the  dependence  of  the  po¬ 
tential  energy  of  a  system  on  its  configuration  to  be  determined 
only  to  an  arbitrary  constant  term  that  has  no  effect  on  the 
change  in  energy.  To  obtain  a  unique  dependence  of  the  potenti¬ 
al  energy  of  the  system  on  its  configuration,  a  so-called  zero 
configuration  is  chosen,  in  which  the  potential  energy  of  the 
system  is  conditionally  assumed  to  equal  zero.  Thus,  the  po¬ 
tential  energy  of  a  system  in  an  arbitrary  state  is  equal  to  the 
work  done  by  all  the  conservative  forces  acting  on  the  system 
in  transforming  it  from  the  state  being  considered  to  the  state 
corresponding  to  the  zero  configuration. 

3.3.3  Example  1.  Potential  energy  of  a  particle  in  a  uniform 
( homogeneous )  force  field.  Assume  that  force  F,  exerted  on  the 
particle  by  the  field,  is  directed  along  axis  OZ,  i.e.  F  =  Fz k, 
where  k  is  the  unit  vector  of  axis  OZ,  and  the  projection  Fz 
of  force  F  onto  axis  OZ  is  independent  of  the  coordinates  of 
the  particle.  Then 

dEp  —  — F  dr  =  —  Fz  dz  and  2?p  (z)  =  —  Fzz  +  Ep  (0), 

where  Ep  (0)  is  the  value  of  the  potential  energy  of  the  particle 
at  the  level  z  =  0. 

In  particular,  the  potential  energy  of  a  particle  of  mass  m  lo¬ 
cated  in  a  uniform  gravity  field  at  the  earth’s  surface  (axis  OZ 
is  directed  vertically  upward,  Fz  =  —mg,  where  g  is  the  free 
fall  acceleration)  is  equal  to 

Ep  (z)  =  mgz  +  Ep  (0). 

3.3.4  Example  2.  Potential  energy  of  a  particle  in  a  central  force 
field.  In  a  conservative  field  of  central  forces  the  particle  is  sub¬ 
ject  to  the  action  of  forces  F  which  are  directed  everywhere 
along  straight  lines  passing  through  a  fixed  point,  called  the 
centre  of  force,  and  depend  only  on  the  distance  r  to  this  point: 
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Here  r  is  the  radius  vector  from  the  centre  of  force  to  the  point 
being-  considered  in  the  field,  and  Fr  ( r )  is  the  projection  of  force 
I-'  onto  the  direction  of  vector  r;  it  depends  only  on  the  distance 
/■.  If  the  particle  is  attracted  to  the  centre  of  force,  Fr  (r)  = 
—  |  F  |  <  0,  but  if  it  is  repelled  from  the  centre  of  force, 
then  Fr  (r)  =  |  F  |  >  0.  An  element  of  work  done  by  force  F  is 

(HI7  =  F  dr  =  Fr  (r)  dr. 

The  potential  energy  of  the  particle  is 

OO 

Fp  (r)=  j  Ft  (r)  dr-\-Ep  (oo). 

r 

Usually  the  energy  of  a  particle,  located  at  an  infinite  distance 
Irom  the  centre  of  force,  is  taken  as  the  reference  point  for  po¬ 
tential  energy,  i.e.  it  is  as¬ 
sumed  that  Ep  (oo)  =  0.  Thus 

OO 

j  Fr  (r)  dr. 

r 

Examples  of  a  central  force  field, 
in  which  the  force  is  inversely 
proportional  to  the  square  of 
the  distance  to  the  centre  of 
iorce  (Fr  (r)ocr2),  are  the  grav¬ 
itational  fields  of  a  particle 
or  a  homogeneous  sphere,  the 
electrostatic  field  of  a  point 
charge,  as  well  as  that  of  a 
hollow  sphere  and  a  solid  sphere  uniformly  charged  along  the 
surface  and  throughout  the  volume,  respectively. 

3.3.5  Example  3.  Potential  energy  of  a  system  of  two  particles 
between  which  central  forces  act,  i.e.  forces  depending  upon  the 
distance  between  the  particles  and  directed  along  the  straight 
line  joining  the  particles.  Shown  in  Fig.  3.3  are  mutually  repul¬ 
sive  forces  F12  and  F21  =  — F12: 


Fig.  3.3 


P 
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where  p  =  r2  —  rj  is  the  radius  vector  from  particle  1  to  par¬ 
ticle  2,  and  Fp  (p)  is  the  projection  of  force  F21  on  the  direction 
of  vector  p  and  depends  only  on  the  distance  p  between  the 
particles.  An  infinitely  small  change  in  the 
v//////,,  potential  energy  of  the  system 


ret 


I 

M 

x  p 

M 

)8  £p(P)—  J 

i  p 

-- 

tz 

Fig.  3.4 


dE p  =  — (F12  drx  +  F21  dr2) 

-  — Fal  dp  =  —  Fp  (p)  dp. 

If  we  assume  that  £p  -+  0  as  p 


Fp  (p)  dp . 


oo,  then 


This  energy  is  often  called  the  mutual  po¬ 
tential  energy  of  two  particles. 

3.3.6  Example  4.  Potential  energy  of  an 
elastic  body  (for  instance,  a  spring)  in  axial 
tension  or  compression.  When  an  elastic 
body  is  deformed,  conservative  internal  forces  (elastic  forces) 
are  developed  in  the  body  that  oppose  deformation.  According 
to  Hooke’s  law,  the  force  Fej  exerted  by  the  deformed  body  A 
(Fig.  3.4)  on  body  B,  causing  the  deformation,  is  proportional 
to  the  amount  of  deformation  (stretch  or  compression): 


Fei  =  — kxi. 


Here  xi  is  the  displacement  vector  of  body  B  that  characterizes 
the  deformation  of  body  A  (in  the  undeformed  state  x  =  0;  in 
compression  x  >  0  and  in  tension  x  <  0),  and  k  >  0  is  a  factor 
characterizing  the  elastic  properties  of  body  A. 

The  potential  energy  of  an  elastically  deformed  body  (in  the 
absence  of  deformation,  i.e.  at  x  =  0,  this  energy  is  assumed 
equal  to  zero) 
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3.4  Law  of  Conservation  of  Mechanical  Energy 

.'1.4.1  The  mechanical  energy,  or  total  mechanical  energy,  is  the 
energy  of  mechanical  motion  and  interaction.  The  mechanical 
energy  £  of  a  system  of  particles  is  equal  to  the  sum  of  their 
kinetic  energy  f?k  and  the  potential  energy  of  interaction  of 
l.liese  particles  with  one  another  and  with  external  bodies: 

K  =  £k  +  Ep. 

An  elementary  increment  of  the  mechanical  energy  of  a  system 
during  the  infinitely  short  time  dt  is  equal  to 

dE 

dE=mDonc+-^-dt, 

where  6lTnonc  is  the  algebraic  sum  of  the  elements  of  work 
done  in  the  time  interval  dt  by  all  the  internal  and  external 
nonconservative  forces.  The  term  ( dEp/dt )  dt  represents  the 
change  during  the  time  interval  dt  in  the  potential  energy  of 
the  system  and,  consequently,  its  total  mechanical  energy 
due  to  nonstationary  external  conservative  forces  (3.3.1). 

3.4.2  If  the  system  is  conservative  (3.1.7),  6kFn0nc  33  0  and 
dEp/dt  =  0,  because,  in  this  case,  there  are  no  nonconservative 
forces  and  the  external  conservative  forces  are  stationary.  Con¬ 
sequently,  the  mechanical  energy  of  such  a  system  E  =  const, 
i.e.  the  following  law,  called  the  law  of  conservation  of  mechan¬ 
ical  energy,  is  valid:  in  the  motion  of  a  conservative  system,  its 
mechanical  energy  remains  unchanged. 

In.  particular,  this  law  holds  for  closed  conservative  systems: 
the  mechanical  energy  of  a  closed  system  does  not  change  with 
time  if  all  the  internal  forces  acting  in  this  system  are  conser¬ 
vative  ones. 

The  law  of  conservation  of  mechanical  energy  is  associated  with 
the  uniformity  of  time.  This  property  of  time  is  manifested  in 
the  fact  that  the  laws  of  motion  of  a  closed  system  (or  a  system 
located  in  a  stationary  external  field)  are  independent  of  the 
choice  of  the  zero  time  reference.  For  example,  in  the  free  fall 
of  a  body  in  a  stationary  conservative  gravity  field  at  the  earth’s 
surface,  the  velocity  of  the  body  and  the  length  of  path  travelled 
depend  only  on  the  length  of  time  in  free  fall  and  initial  vel¬ 
ocity  of  the  body,  but  not  on  the  specific  instant  of  time  the 
body  began  to  fall. 
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3.4.3  The  mechanical  energy  of  a  closed  nonconservative  system 
is  changed  owing  to  the  work  done  by  all  the  nonconservative 
internal  forces: 

dE  —  SkFnonc- 

Gyroscopic  forces  (3.1.7)  do  no  work  and  make  no  contribution 
to  6PFnonc,  i.e.  the  existence  of  such  forces  in  a  system  docs 
not  lead  to  changes  in  its  mechanical  energy. 

The  action  of  dissipative  forces  (3.1.7),  for  instance,  the  force 
of  friction,  leads  to  a  gradual  reduction  in  the  mechanical  energy 
of  a  closed  system.  This  process  is  called  the  dissipation  of 
energy.  Consequently,  a  system  whose  mechanical  energy  contin¬ 
ually  decreases  with  time  is  called  a  dissipative  system.  In 
the  dissipation  of  energy,  the  mechanical  energy  of  the  system 
is  converted  into  other  forms  of  energy  (for  example,  into  the 
energy  of  chaotic  motion  of  molecules).  The  conversion  of 
mechanical  energy  is  accomplished  in  full  agreement  with 
a  universal  law  of  nature:  the  law  of  conservation  of  energy 
(9.2.7). 

According  to  this  law,  energy  can  be  converted  from  one  form 
to  another  and  redistributed  within  the  system,  but  its  total 
amount  in  a  closed  system  must  remain  constant.  It  follows 
from  the  law  of  conservation  and  conversion  of  energy  that  the 
changes  in  energy  of  an  open  system,  occurring  in  interaction 
of  the  system  with  the  external  medium  (external  bodies  and 
fields),  must  be  numerically  equal  and  opposite  in  sign  to  the 
changes  in  the  energy  of  the  external  medium.  In  other  words, 
the  change  in  the  energy  of  a  system  upon  its  interaction  with 
the  external  medium  should  be  equal  to  the  energy  the  system 
receives  or  yields  up  in  the  process  being  considered. 

3.4.4  Forces  of  resistance  and  friction  act  in  all  real  mechanical 
systems.  Hence,  these  systems  are  nonconservative.  In  certain 
cases,  however,  they  can  be  assumed  approximately  conserva¬ 
tive,  and  the  law  of  conservation  of  mechanical  energy  can  be 
applied  to  them.  Such  an  approach  is  feasible  if,  in  the  process 
being  considered,  the  work  ITnonc  done  by  all  the  nonconserva¬ 
tive  forces  acting  on  the  system  is  negligibly  small  in  compari¬ 
son  to  the  mechanical  energy  E  of  the  system,  i.e.  |  WnonclE  |  <c 
<  1,  so  that  |  A ElE  |  <  1,  where  A£  =  IFnonc  is  the  change 
in  the  mechanical  energy  of  the  system. 

3.4.5  The  state  of  mechanical  equilibrium  of  a  system  is  one 
that  can  be  disturbed  only  as  a  result  of  external  force  action 
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applied  to  the  system.  In  this  equilibrium  state  all  the  particles 
n|  the  system  are  at  rest  so  that  the  kinetic  energy  of  the  system 
equals  zero.  The  state  of  mechanical  equilibrium  is  said  to  be 
stable  if  a  small  external  action  applied  to  the  system  leads  to 
only  a  small  change  in  its  state.  In  this  case,  forces  are  devel¬ 
oped  in  the  system  that  tend  to  restore  the  equilibrium  state. 
A  state  of  mechanical  equilibrium  is  said  to  be  unstable  if  it 
can  be  disturbed  by  even  an  infinitely  small  external  action 
on  the  system,  which  does  not  return  again  to  the  equilibrium 
state.  In  this  case,  forces  are  developed  in  the  system  that  tend 
lo  further  disturb  the  equilibrium  state. 

The  law  of  conservation  of  mechanical  energy  enables  the  condi- 
lions  of  equilibrium  of  conservative  systems  to  be  formulated: 


Fig.  3.5 


in  states  of  stable  equilibrium  the  potential  energy  curve  of 
the  system  has  minima,  whereas  in  states  of  unstable  equilib¬ 
rium,  it  has  maxima. 

.1.4.6  On  the  basis  of  the  law  of  conservation  of  mechanical 
energy,  the  region  of  possible  configurations  of  a  conservative 
system  (3.3.1)  can  be  found.  The  kinetic  energy  of  the  system 
k  >  0.  Hence,  at  a  given  value  E  of  the  mechanical  energy 
of  the  system,  the  latter  can  only  be  in  states  that  conform  to 
the  condition:  Ep  sgi  E.  Figure  3.5  illustrates  the  simplest  case 
i  n  which  a  particle  has  one-dimensional  motion  along  axis  OX 
in  an  external  stationary  conservative  field.  The  potential 
energy  of  the  particle  is  a  function  of  only  a  single  coordinate  x, 
i.e.  Ep  =  Ep  (x).  The  curve  of  this  relation,  shown  in  Fig.  3.5, 
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is  called  a  potential  energy  curve.  For  a  fixed  value  of  the  mech¬ 
anical  energy  E  of  the  particle,  as  shown  in  Fig.  3.5,  the  particle 
can  remain,  in  its  travel,  in  one  of  the  following  three  regions: 
x  <  (region  I),  x2  <  x  <  x3  (region  III)  and  x  >  xA  (re¬ 
gion  V).  These  regions  are  separated  from  one  another  by  the 
regions  II  and  IV,  which  are  the  so-called  potential  barriers  aeb 
and  cgd,  within  whose  limits  the  particle  cannot  exist.  At  the 
boundaries  of  the  potential  barriers  (at  points  a,  b,  c  and  d), 
the  particle  reverses  its  direction  of  motion;  in  region  I  the 
particle  can  travel  without  limit  to  the  left  of  boundary  a  of 
the  barrier,  and  in  region  F,  without  limit  to  the  right  of  bound¬ 
ary  d  of  the  barrier.  In  region  III,  the  particle  oscillates  be¬ 
tween  points  6  and  c ;  it  is  in  a  so-called  potential  well  bfc. 


3.5  Perfectly  Elastic  and  Inelastic  Collisions 

3.5.1  An  impact  in  which  a  considerable  change  in  the  veloc¬ 
ities  of  the  colliding  bodies  occurs  in  an  extremely  short  time 
interval  is  called  a  collision.  Examples  of  collisions  are  a  ham¬ 
mer  striking  a  billet  being  forged  that  lies  on  the  anvil,  and 
a  hammer  striking  the  head  of  a  nail  being  driven. 

The  line  of  impact  is  the  common  normal  to  the  surfaces  of  the 
colliding  bodies  at  their  point  of  contact.  If  the  centres  of  mass 
(2.3.3)  of  the  colliding  bodies  lie  on  the  line  of  impact  at  the  ins¬ 
tant  of  contact,  the  collision  is  said  to  be  central.  An  example  is 
the  collision  of  two  billiard  balls.  The  collision,  or  impact,  is 
said  to  be  head-on,  or  straight-line,  if  the  velocities  of  the  centres 
of  mass  of  the  colliding  bodies  are  parallel,  before  the  collision, 
to  the  line  of  impact.  Otherwise,  it  is  called  an  oblique  impact, 
or  glancing  collision. 

3.5.2  In  a  collision  the  bodies  are  deformed  and  instantaneous, 
but  quite  substantial,  forces  are  developed.  They  are  called 
impact  forces.  For  a  system  of  colliding  bodies  these  forces  are 
internal  ones*,  i.e.  they  do  not  change  the  total  momentum  of 
the  system.  External  forces,  constantly  exerted  on  the  system 


*  It  is  assumed  that  the  colliding  bodies  are  either  free  (2.2i3) 
or  the  constraints  imposed  on  them  are  such  that  no  impact 
constraint  reactions  are  developed. 


:i.5]  ELASTIC  AND  INELASTIC  COLLISIONS 


67 


(for  instance,  the  gravity  forces  of  the  bodies)  are  usually  very 
small  compared  to  the  impact  forces.  Therefore,  even  though 
the  impulses  of  impact  forces  (2.4.2)  in  the  duration  x  of  the 
collision  are  commensurable  with  the  momenta  (2.3.4)  of  the 
colliding  bodies,  the  resultant  impulse  of  all  the  constantly 
acting  external  forces  during  the  same  time  interval  x  is  small 
compared  to  the  momenta  of  the  bodies.  Accordingly,  the  work 
done  on  the  system  by  the  external  forces  during  the  time  inter¬ 
val  x  is  small  compared  to  the  mechanical  energy  of  the  system. 
Hence,  a  system  of  bodies  can  be  assumed,  in  the  collision  pro¬ 
cess,  to  be  an  approximately  closed  system  (2.2.4).  This  means 
that  in  calculating  the  results  of  the  collision,  use  can  be  made 
of  the  laws  of  the  conservation  of  momentum  (2.7.1),  of  angular 
momentum  (4.4.1)  and  of  energy  (9.2.7).  If  the  bodies  deform 
in  the  collision  like  perfectly  elastic  ones,  the  impact  forces 
are  conservative  and  the  law  of  conservation  of  mechanical 
energy  (3.4.2)  is  valid  in  the  system. 

3.5.3  The  collision  of  two  bodies  is  said  to  be  perfectly  inelastic 
if,  after  impact,  they  travel  together  as  a  single  body.  Suffi¬ 
ciently  close  to  a  perfectly  inelastic  collision  are,  for  example, 
the  impact  of  the  pile-driving  hammer  with  the  end  of  the  pile 
being  driven,  and  the  firing  of  a  bullet  into  a  wagon  filled  with 
sand,  when  the  bullet  remains  embedded  in  the  sand.  Various 
processes  occur  in  the  colliding  bodies  in  an  inelastic  collision 
(plastic  deformation  of  the  bodies,  friction,  etc.).  As  a  result, 
the  kinetic  energy  of  the  system  is  partly  transformed  into  its 
internal  energy  (9.1.2). 

If.two  bodies  of  masses  m j  and  m2,  travelling  with  translational 
motion  at  the  velocities  vx  and  v2,  undergo  a  perfectly  inelastic 
head-on  central  collision,  they  continue  their  translational  motion 
together  at  the  velocity 

u=  mlVl  ~f~  m2v2 

Note.  In  the  case  of  a  perfectly  inelastic  collision  that  is  neither 
head-on  nor  central,  the  preceding  equation  can  be  used  to  find 
the  velocity  of  the  centre  of  the  mass  of  the  bodies  joined  to¬ 
gether  in  the  collision.  As  the  result  of  such  a  collision,  how¬ 
ever,  the  system  may  begin  to  rotate  about  its  centre  of  mass  in 
accordance  with  the  law  of  conservation  of  angular  momen¬ 
tum  (4.4.1). 
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3.5.4  The  change  in  the  kinetic  energy  of  a  system  of  two  collid¬ 
ing  bodies  in  a  perfectly  inelastic  head-on  central  collision  is 


+  u* 


V 


2 

2 


m1m2 

2  (mi+mj) 


(Vi— v2)2<  0. 


In  particular,  if  the  second  body  is  at  rest  before  the  collision 
(for  instance,  a  pile  being  driven  by  a  pile  driver,  or  a  forging 
lying  on  the  anvil),  the  relative  reduction  in  the  kinetic  energy 
of  the  system  in  a  perfectly  inelastic  head-on  central  collision  is 

_  to2 

In  engineering,  a  perfectly  inelastic  head-on  central  collision 
(impact)  is  made  use  of  to  change  the  shape  of  bodies  (smith 
and  die  forging,  rivetting,  etc.)  or  to  move  a  body  in  a  medium 
with  high  resistance  to  motion  (driving  nails,  piles,  etc.).  It 
proves  expedient  in  the  first  case  to  have  the  ratio  —AE-k/Eh1 
as  near  as  possible  to  unity,  i.e.  it  is  necessary  that  m2  >  m1 
(the  mass  of  the  forged  item  and  the  anvil  should  be  many  times 
greater  than  that  of  the  hammer).  In  the  second  case,  on  the 
contrary,  it  is  necessary  that  the  loss  in  kinetic  energy  in  the 
collision  (impact)  be  as  low  as  possible,  i.e.  that  m1  >  m2 
(the  mass  of  the  hammer  should  be  many  times  greater  than 
that  of  the  nail  being  driven). 

3.5.5.  A  collision  is  said  to  be  perfectly  elastic  if  it  does  not  lead 
to  any  change  in  the  mechanical  energy  of  the  system,  i.e.  if 
the  bodies  are  perfectly  elastic. 

Example  1.  A  perfectly  elastic  head-on  central  collision  of  two 
bodies  (for  instance,  balls)  of  masses  m1  and  m2,  which  travel 
before  the  collision  with  translational  motion  at  the  velocities 
vx  and  v2  along  axis  OX,  passing  through  their  centres  of  mass 
(Fig.  3.6a).  The  velocities  of  the  bodies  after  the  collision, 
ux  and  u2  (Fig.  3.66),  can  be  found  from  the  laws  of  conserva¬ 
tion  of  momentum  and  mechanical  energy: 


+  m2u2  =  mlv1  +  ra2v2,  m1uf  +  m2u|  =  mjvf  +  m2v|. 
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Velocities  ux  and  u2  are  directed  along  axis  OX  and  their 
projections  on  this  axis  are 


u 


ix  — 


(mx  —  m2)  Vxx-^-ZmzVjx 


and 


uix  — 


2miVlx  +  (mi—ml)  v2x 
mi  “h  mt 


In  particular,  if  the  masses  of  the  bodies  are  the  same,  the 
bodies  exchange  velocities  in  a  collision:  ulx  —  v2x  and  u2x  = 
~  vlx- 

If  the  mass  of  the  second  body  is  many  times  greater  than  that 
of  the  first  body,  then  ulx  ~  2vix  —  vlx  and  u2x  «  v,  , 


&  Before  collision  (ty  After  collision 


Fig.  3.6 


3.5.6  Example  2.  Perfectly  elastic  oblique  central  collision. 
If  the  bodies  are  smooth  the  impulse  of  friction  forces  in  the 
collision  can  be  neglected.  In  this  case  the  tangential  compo¬ 
nents  of  the  velocities  of  the  bodies  are  not  changed,  i.e.  compo¬ 
nents  perpendicular  to  the  line  of  impact:  uxx  =  vlX  and  u2x  = 
=  v2x-  The  normal  components,  directed  along  the  line  of 
impact,  change  in  the  same  way  as  in  a  head-on  collision: 

(ntj  —  m2)  I'm  +  2 m2v2n 

“m - i  „ 

m1  m2 

. ..  2miuln  +  (m2  —  mi)  vsn 

ailQ  U27l  —  - — - 7— -  • 

"1  m2 

In  particular,  in  the  perfectly  elastic  oblique  collision  of  a 
smooth  ball  wi'h  a  fixed  flat  wall  (rn2  >•  m1  and  u2  =  v2  =  0): 

“it  “  “it  and  u,n  =  —I'm, 
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i.e.  the  ball  rebounds  from  the  wall  according  to  the  law  of  mir¬ 
ror  reflection:  the  angle  of  reflection  is  equal  to  the  angle  of 
incidence.  The  magnitude  of  the  velocity  is  conserved:  ux  =  14. 
The  vector  Apx  of  the  change  in  momentum  of  the  ball  in  the 
collision  is  directed  perpendicular  to  the  wall: 

A  Pi  =  (ux  —  vx)  =  ~2m1\ln. 

The  impulse  of  impact  force  acting  on  the  wall  is  equal  to  2m1\ln. 


CHAPTER  4  DYNAMICS  OF  ROTARY  MOTION 
4.1  Moment  of  Force  and  Angular  Momentum 

4.1.1.  The  moment  of  force,  or  force  moment  is  a  concept 
introduced  in  mechanics  to  characterize  theexternal  mechanical 
action  exerted  on  a  body  that  changes  its  rotary  motion.  Dis¬ 
tinction  is  made  between  the  moment  of  force  about  a  fixed 
point  and  about  a  fixed  axis. 

The  moment  of  the  force  F  about  a  fixed  point  O  (the  pole)  is  the 
vector  quantity  M,  equal  to  the  vector  product  of  the  radius 

vector  r,  from  point  0  to  point  A 
of  force  application  (Fig.  4.1),  by 
the  force  vector  F: 

M  =  [rF]. 

The  magnitude  of  the  moment  of 
force  is  7W  —  Fr  sin  a  =  FI, 
where  a  is  the  angle  between 
vectors  r  and  F,  and  l  =  r  sin  a 
is  the  length  of  the  perpendic¬ 
ular  OB  (Fig.  4.1)  dropped  from 
point  0  to  the  line  of  action  of  the  force.  The  quantity  l  is 
called  the  arm  of  the  force  with  respect  to  point  0.  If  the  point 
of  application  of  force  F  is  displaced  along  its  line  of  action, 
the  moment  M  of  this  force  about  the  same  fixed  point  0  is  not. 
changed.  If  the  line  of  action  of  the  force  passes  through  point  O, 
the  moment  of  the  force  about  point  O  equals  zero. 

4.1.2  The  principal,  or  resultant,  moment  of  a  system  of  forces 
about  fixed  point  O  [pole)  is  the  vector  M,  equal  to  the  geometric 
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niim  of  the  moments  about  point  0  of  all  n  forces  of  the  system: 

m  =  2  [nF,]t 

wtiere  r£  is  the  radius  vector  from  point  O  to  the  point  of  appli¬ 
cation  of  force  F;. 

H  follows  from  Newton’s  third  law  (2.5.1)  that  the  moments 
of  the  internal  forces  of  interaction  between  the  particles  of 
a  system  about  pole  0  are  com¬ 
pensated  for  (cancelled  out)  pair¬ 
wise:  =  —  [rftFfei].  Con¬ 

sequently,  in  calculating  the 
resultant  moment  of  forces,  on¬ 
ly  the  external  forces  exerted  on 
the  mechanical  system  being 
considered  need  be  taken  into 
account. 

4.1.3  The  moment  of  Jorce  F 
about  a  fixed  axis  a  is  the  scalar 
quantity  Ma,  equal  to  the  pro¬ 
jection  on  this  axis  of  vector  M, 
which  is  the  moment  of  force  F 
about  arbitrary  point  0  of  axis  a. 

The  value  of  moment  Ma  is  independent  of  the  choice  of  the 
position  of  point  0  along  axis  a. 

Note.  Sometimes  the  moment  of  a  force  about  fixed  axis  a 
is  understood  to  mean  the  vector  quantity  Ma  =  Maia,  where 
i„  is  the  unit  vector  of  axis  a.  The  vector  Ma  is  the  compo¬ 
nent  of  vector  M  of  the  moment  of  force  about  pole  O,  this 
component  being  directed  along  axis  a. 

I  f  the  line  of  action  of  the  force  intersects  or  is  parallel  to  axis  a, 
the  moment  of  force  about  this  axis  is  equal  to  zero. 

Lot  A  be  the  point  of  application  of  force  F,  and  be  the  foot 
of  the  perpendicular  dropped  from  point  A  to  the  axis  OZ  being 
considered  (Fig.  4.2).  It  proves  convenient  to  resolve  force  F 
into  three  mutually  perpendicular  components:  axial  compo¬ 
nent  Fz,  parallel  to  axis  OZ;?radial  component  F„,  directed  along 

vector  p  =  0{A,  and  tangential  comp'onent  Fx,  directed  per¬ 
pendicular  to  the  axis  and  perpendicular  to  vector  p.  The  mo- 
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ment  of  force  F  about  axis  OZ  is 
Mz  =  [pF  T]  z  and  Mz  =  [pFx], 

Since  vectors  p  and  Fx  are  perpendicular  to  each  other 
|  M2  |  =  |  Mz  |  =  p  |  Ft  |. 

The  principal ,  or  resultant,  moment  of  a  system  of  forces  about 
a  fixed  axis  a  is  equal  to  the  algebraic  sum  of  the  moments  of  all 
the  forces  of  the  system  about  this  axis. 

4.1.4  The  angular  momentum,  or  moment  of  momentum,  of  a 
particle  about  fixed  point  0  (pole)  is  the  vector  L.  It  is  equal 
to  the  vector  product  of  radius  vector  r,  from  point  0  to  the 
location  of  the  particle,  by  vector  p  of  its  momentum: 

L  =  [rp]  =  [rmv), 

where  m  and  v  are  the  mass  and  velocity  of  the  particle. 

The  angular  momentum  of  a  system  about  fixed  point  0  is  denoted 
by  vector  L  and  equals  the  vector  sum  of  the  angular  momenta 
of  all  the  particles  of  the  system  about  the  same  point  0: 

n  n 

L  =  S  [riPi]—  S  [r m^i), 

i—l  i—i 

where  mt,  r;  and  v;  are  the  mass,  radius  vector  and  velocity 
of  the  ith  particle,  and  n  is  the  total  number  of  particles. 

The  angular  momentum  of  a  system  about  fixed  axis  a  is  the  scalar 
quantity  La,  equal  to  the  projection  on  axis  a  of  vector  L,  the 
angular  momentum  of  the  system  about  some  point  O  lying 
on  this  axis: 
n 

La  =  [r,-roiVi]a  . 
i=i 

The  numerical  value  Ln  of  the  angular  momentum  is  indepen¬ 
dent  of  the  choice  of  the  position  of  point  0  along  axis  a. 

Note.  Sometimes  the  angular  momentum  of  a  system  about 
a  fixed  axis  a  is  understood  to  mean  the  vector  quantity  La  =■ 
=  Laia,  where  ia  is  the  unit  vector  of  axis  a. 
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4 . 1 .5  The  angular  momentum  of  a  body  rotating  about  a  fixed 
/wint  O  at  the  angular  velocity  w  is  equal  to: 


I-  = 


f  [rv] 

(m) 


dm  = 


\  [r  [®r]] 

(m) 


dm, 


where  r  is  the  radius  vector  from  point  O  to  an  infinitely  small 
element  of  the  body  of  mass  dm,  and  v  =  [®r]  is  the  velocity 
of  the  element  of  the  body.  Since  [r  [®r]]  =  r2ca  —  (cor)  r, 
vectors  L  and  cd  do  not  coincide  in  direction  in  the  general 
case: 


L  =  fi> 


r  dm. 


The  angular  momentum  of  a  body  hinged  at  point  0  and  its 
angular  velocity  coincide  in  direction  if  the  body  rotates  about 
one  of  its  principal  axes  of  inertia  (4.2.4)  passing  through 
point  O 


L  =  /©, 

where  I  is  the  moment  of  inertia  (4.2.1)  of  the  body  about  this 
principal  axis. 

4.1.6  The  resultant  moments  M  and  M*  of  a  system  of  forces 
about  two  different  fixed  points  0  and  0*  are  related  by  the 
equation 

M  =5=  M*  +  [r*F], 

where  r*  is  the  radius  vector  from  origin  O  to  point  0*,  and  F 
is  the  principal  vector  of  the  system  of  forces  being  considered. 
If  F  =  0  the  resultant  moment  of  the  system  of  forces  is  the 
same  about  any  fixed  point:  M*  =  M.  This,  precisely,  is  the 
property  possessed  by  a  force  couple,  or  simply  couple,  i.e. 
a  system  of  two  forces,  equal  in  magnitude  and  directed  along 
two  parallel  straight  lines  in  opposite  directions.  The  shortest 
(perpendicular)  distance  d  between  the  lines  of  action  of  the 
forces  is  called  the  arm  of  the  couple.  The  moment,  or  torque, 
of  a  force  couple  is  directed  perpendicular  to  the  plane  of  the 
forces,  and  its  magnitude  is  equal  to  M  =  Fd,  where  F  is  the 
magnitude  of  each  of  the  forces  of  the  couple. 
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The  resultant  moment  Mc  of  all  the  forces  acting  on  a  mechan¬ 
ical  system  about  the  centre  of  mass  (2.3.3)  of  this  system  is 
related  to  the  resultant  moment  M  of  the  same  system  of  forces 
about  fixed  point  O  by  the  equation 

M  =  Mc  +  [rcF], 

where  rc  is  a  radius  vector  from  origin  0  to  point  C,  and  F  is 
the  principal  vector  of  the  system  of  forces. 

4.1.7  The  value  of  the  angular  momentum  of  a  mechanical 
system  about  its  centre  of  mass  C  is  the  same  for  the  absolute 
motion  of  the  particles  at  the  velocities  v;  (i.e.  with  respect 
to  a  fixed  inertial  frame  of  reference)  and  for  the  relative  motion 
at  the  velocities  vi  =  v*  —  vc  (i.e.  with  respect  to  a  frame  of 
reference  in  translational  motion  with  the  origin  of  coordinates 
at  point  C): 
n  n 

2  =  2  [r-ro*v!]  =  Lc, 

!  =  t  1=1 

where  n  =  rt  —  rc  is  the  radius  vector  of  the  ith  particle  in 
a  frame  of  reference  travelling  together  with  the  centre  of 
mass. 

The  relation  between  the  values  of  the  angular  momentum  L 
of  a  mechanical  system  about  fixed  point  0  and  Lc  about  the 
system’s  centre  of  mass  is  of  the  form 

L  =  Lc  +  frcP3> 

n 

where  p  =  2  mivi  ’s  t^le  momentum  of  the  system  in  its 
2=1 

absolute  motion. 

4.2  Moment  of  Inertia 

4.2.1  The  moment  of  inertia  of  a  mechanical  system  about  fixed 
axis  a  is  the  physical  quantity  denoted  by  I a  and  equal  to  the 
sum  of  the  products  of  the  masses  of  all  n  particles  of  the  system 
by  the  squares  of  their  normal  distances  from  the  axis: 

n 

I  a—  2  m»Pft 

i=l 
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where  m;  and  p;  are  the  mass  of  the  ith  particle  and  its  normal 
distance  from  the  axis. 

The  moment  of  inertia  of  a  body  about  fixed  axis  a  is 


(m)  (V) 


where  dm  —  D  dV  is  the  mass  of  an  infinitely  small  element  of 
volume  dV  of  the  body,  D  is  the  density  and  p  is  the  normal 
distance  from  the  element  dV  to  axis  a. 

I  f  the  body  is  homogeneous,  i.e.  its  density  is  the  same  through¬ 
out,  then 

Ia  =  D  j  p *dV. 

( V ) 

The  moment  of  inertia  I a  is  a  measure  of  inertness  of  a  body  in 
its  rotary  motion  about  fixed  axis  a  (4.3.4),  in  the  same  way 
that  its  mass  is  a  measure  of  inertness  of  the  body  in  its  transla¬ 
tional  motion. 

4.2.2  The  moment  of  inertia  of  a  given  body  about  some  axis 
depends,  not  only  on  the  mass,  shape  and  size  of  the  body,  but 
also  on  the  position  of  the  body  with  respect  to  this  axis.  Ac¬ 
cording  to  Steiner's  theorem  of  parallel  axes,  the  moment  of 
inertia  I  of  a  body  about  an  arbitrary  axis  is  equal  to  the  sum 
of  its  moment  of  inertia  Ic  about  a  parallel  axis  through  the 
centre  of  mass  (or  of  inertia)  of  the  body  plus  the  product  of 
the  mass  m  of  the  body  by  the  square  of  the  normal  distance  d 
between  the  two  axes: 

I  =  Jc  - f-  md 2. 

4.2.3  The  moments  of  inertia  of  certain  homogeneous  regularly 
shaped  bodies  about  certain  axes  are  listed  in  Table  4.1. 

4.2.4  The  products  of  inertia  of  a  body  about  the  axes  of  a  rec¬ 
tangular  Cartesian  coordinate  system  are  the  quantities: 


'  xy  - 


xy  dm 


(m) 


«=  j  xyDdV, 
( V ) 
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TABLE  4.1 


Body 

Position  of  axis  a 

Moment  of 
inertia  Ia 

Hollow  thin-walled  cir¬ 
cular  cylinder  of  radius 
R  and  mass  m 

Axis  of  the  cylinder 

mR 2 

Solid  circular  cylinder 
(or  disk)  of  radius  R  and 
mass  m 

Axis  of  the  cylinder 

mRV  2 

Solid  sphere  of  radius  R 
and  mass  m 

Axis  through  the  centre 
of  the  sphere 

2mR2/5 

Hollow  thin-walled 
sphere  of  radius  R  and 
mass  m 

Axis  through  the  centre 
of  the  sphere 

2mR*/3 

Thin  straight  rod  of 
length  l  and  mass  m 

Axis  perpendicular  to  the 
rod  and  through  its 
centre  along  its  length 

ml2 1X2 

The  same  rod 

Axis  perpendicular  to  the 
rod  and  through  one  end 
of  it 

ml*l3 

I  xz  =  \  xz  dm  —  \  xzDdV, 

<m)  (V) 

Iyz  =  \  yzdm  =  \  yzDdV, 

< m )  (V) 

where  x,  y  and  z  are  the  coordinates  of  an  infinitely  small 
element  of  the  body  of  volume  dV,  density  D  and  mass  m. 

Axis  OX  is  called  the  principal  axis  of  inertia  of  the  body  if 
the  products  of  inertia  Ixy  and  fxz  are  simultaneously  equal 
to  zero.  Three  principal  axes  of  inertia  can  be  passed  through 
each  point  of  the  body.  The  axes  are  mutually  perpendicular. 
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The  moments  of  inertia  of  a  body  about  the  three  principal  axes 
<il'  inertia,  passed  through  an  arbitrary  point  0  of  the  body,  are 
railed  the  principal  moments  of  inertia  of  the  body. 

The  principal  axes  of  inertia,  passing  through  the  centre  of  mass 
of  the  body,  are  called  the  principal  central  axes  of  inertia  of  the 
body ,  and  the  moments  of  inertia  about  these  axes  are  called 
the  principal  central  moments  of  inertia  of  the  body.  An  axis  of 
symmetry  of  a  homogeneous  body  is  always  one  of  its  principal 
central  axes  of  inertia. 


4.3  The  Fundamental  Law  in  the  Dynamics 
of  Rotary  Motion 

4.3.1  It  follows  from  Newton’s  laws  that  the  first  derivative 
with  respect  to  time  t  of  the  angular  momentum  L  of  a  mechan¬ 
ical  system  about  any  fixed  point  O  is  equal  to  the  resultant 
moment  Mext  of  all  the  external  forces,  applied  to  the  system, 
about  the  same  point  O: 


This  equation  expresses  the  law  of  the  variation  of  angular  mo¬ 
mentum  of  the  system.  It  is  valid,  in  particular,  for  a  rigid  body, 
binged  at  point  O  and  rotating  about  it.  In  this  case,  the  equa- 
I ion  expresses  the  fundamental  law  of  dynamics  of  a  rigid  body 
i  eluting  about  a  fixed  point. 

I  n  terms  of  the  projections  on  the  axes  of  a  fixed  rectangular 
Cartesian  coordinate  system  with  its  origin  at  point  0,  the  law 
liir  the  variation  of  the  angular  momentum  is  written  in  the 
form 


ext 
*  ’ 


dLy 

dt 


M 


ext 


and 


dLz 

dt 


M 


ext 

2 


Here  Lx,  Ly,  Lz,  Af|xt,  Myxt  and  Af®xt  are  the  angular  mo¬ 
menta  of  the  system  and  the  resultant  moments  of  the  external 
Inrces  about  the  corresponding  coordinate  axes. 

4.3.2  Example.  The  regular  precession  of  a  gyroscope  due  to  its 
gravitational  force.  A  gyroscope  ( symmetrical  gyroscope)  is  a  rigid 
body  spinning  (rotating  rapidly)  about  its  axis  of  symmetry 
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that  can  change  its  position  in  space.  A  gyroscope  has  three 
degrees  of  freedom  (1.5.6)  if  it  is  mounted  at  fixed  point  O, 
which  lies  on  its  axis  and  is  called  the  centre  of  suspension  of  the 
gyroscope.  If  the  centre  of  suspension  coincides  with  the  centre 
of  gravity  C  (7.3.2)  of  the  gyroscope,  the  gyroscope  is  said  to  be 
balanced',  the  force  of  gravity  does  not 
affect  its  state  of  rotation.  Otherwise, 
it  is  said  to  be  a  heavy  gyroscope,  or 
top  (Fig.  4.3).  Owing  to  the  action  of 
the  moment  of  gravitational  force  about 
point  0 

Mcxt  =  [rcmg] 

a  heavy  gyroscope  rotates  about  this 
point  so  that  its  axis  OZ'  rotates  uni¬ 
formly  about  vertical  axis  OZ  describ¬ 
ing  a  conical  surface  (shown  by  dashed 
lines  in  Fig.  4.3).  Such  motion  of  the 
gyroscope  is  called  regular  precession. 
If  the  angular  velocity  of  precession 
Q  <C  (o  (where  co  is  the  spin  angular  velocity  or,  simply, 
spin  velocity  of  the  gyroscope  about  its  axis  of  symmetry 
OZ'),  it  can  be  assumed  with  some  approximation  that 
the  angular  momentum  L  of  the  gyroscope  about  point  0  is 
directed  along  axis  OZ'  of  the  gyroscope  and  equals 

L  =  /©, 

where  I  is  the  moment  of  inertia  of  the  gyroscope  about  axis  OZ' . 
Hence 

[rCmg]  =  \-j§-  Lmg]  =  [fib] , 

where  J2  =  — mrcg//w2,  is  the  angular  velocity  of  precession, 
and  a>z,  =  to  in  the  case  shown  in  Fig.  4.3.  The  greater  the  spin 
angular  velocity  of  the  gyroscope  the  slower  its  precession. 

4.3.3  The  kinetic  energy  of  a  rigid  body  rotating  about  a  fixed 
point  at  the  angular  velocity  to  is 

„  /w2 


4.3]  THE  LAW  OF  DYNAMICS  OF  ROTARY  MOTION 


79 


where  I  is  the  moment  of  inertia  of  the  body  about  the  instan¬ 
taneous  axis  of  rotation  (1.5.6). 

An  element  of  work  done  in  the  infinitely  short  time  interval  dt 
by  force  F  acting  on  the  body  is 

hW  =  M©  dt  =  M  d<j>  =  Madtp, 

where  M  =  [rF]  is  the  moment  of  force  F  about  point  O  (r  is 
the  radius  vector  from  point  0  to  the  point  of  application  of 
force  F),  dtp  =  to  dt  and  dtp  =  ©  dl  are  the  angle  of  rotation 
and  the  vector  of  an  element  of  rotation  of  the  body  in  the  time 
interval  dt,  and  Ma  is  the  moment  of  force  F  about  an  instan¬ 
taneous  axis  of  rotation  (Mw  is  equal  to  the  projection  of  vector 
M  on  the  direction  of  vector  ©). 

The  change  in  the  kinetic  energy  of  a  rotating  rigid  body  during 
the  time  interval  dt  is  equal  to  the  work  done  by  the  external 
forces: 

dEk  =  M^dtp, 

where  is  the  resultant  moment  (4.1.3)  of  external  forces 

about  an  instantaneous  axis  of  rotation  of  the  body. 

4.3.4  If  a  rigid  body  rotates  about  fixed  axis  OZ  at  the  angular 
velocity  ©,  its  angular  momentum  about  this  axis  is 

l'z  =  Izaz  and  Lz  =  /*»• 

Here  Iz  is  the  moment  of  inertia  of  the  body  about  axis  OZ 
and  does  not  change  with  time  (/,  =  const),  and  |  ©z  |  = 
=  m  >  0  (mz  =  ©  if  vector  ©  and  the  unit  vector  of  axis  OZ 
coincide  in  direction,  otherwise  ©z  =  — ©). 

The  fundamental  law  in  the  dynamics  of  a  rigid  body  rotating 
about  fixed  axis  OZ  is 

=  Mfxt  or  e  =  -J-Mft, 

where  e  =  dw/dt  is  the  angular  acceleration  of  the  body. 

It  is  evident  from  the  preceding  equation  that  the  moment  of 
inertia  of  a  rigid  body  about  some  fixed  axis  is  a  measure  of 
inertness  of  the  body  in  its  rotation  about  the  given  axis:  the 
greater  the  moment  of  inertia  of  the  body,  the  less  the  angular 
acceleration  it  acquires  under  the  action  of  the  same  moment  of 
external  forces. 
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4.3.5  The  kinetic  energy  of  a  rigid  body  rotating  about  fixed 
axis  OZ  at  the  angular  velocity  co  is 

An  element  of  work  done  during  the  infinitely  short  time  dt 
by  force  F  applied  to  the  l  ody  is 

=  M2(o  dt  =  Mz  d cp, 

where  Mz  is  the  moment  of  force  F  about  the  axis  OZ  of  rota¬ 
tion  (the  unit  vector  of  axis  OZ  coincides  in  direction  with 
vector  to). 

The  change  in  the  kinetic  energy  of  a  rigid  body  during  the 
time  interval  dt  is  equal  to  the  work  done  by  the  external  forces 
during  this  interval: 

dEk  =  A/|xtdcp, 

where  Mzxl  is  the  resultant  moment  of  external  forces  about 
the  axis  of  rotation  of  the  body. 

4.3.6  The  motion  of  a  free  rigid  body  satisfies  the  following  two 
differential  equations: 

Here  m  is  the  mass  of  the  body,  vc  is  the  velocity  of  its  centre 
of  mass  C,  l*ext  is  the  principal  vector  of  external  forces  applied 
to  the  body  (2.5.2),  M^,xt  is  the  resultant  moment  of  external 
forces  about  point  C  (4.1.6),  and  Lc  is  the  angular  momentum 
about  the  same  point  C  (4.1.7). 

The  first  equation  describes  translational  motion  of  the  free 
body  at  the  velocity  of  its  centre  of  mass  (2.5.3).  The  second 
equation  follows  from  the  law  of  the  variation  of  angular 
momentum  (4.3.1)  and  describes  the  rotation  of  a  rigid  body 
about  its  centre  of  mass  (1.5.9). 

4.3.7  The  kinetic  energy  of  a  free  rigid  body  can  be  deter¬ 
mined  on  the  basis  of  Koenig’s  theorem  (3.2.4): 

mvC  ,  /c®2 


4]  CONSERVATION  OF  ANGULAR  MOMENTUM 


81 


where  Ic  is  the  moment  of  inertia  of  the  body  about  an  instan¬ 
taneous  axis  of  rotation  passing  through  its  centre  of  mass  C, 
and  co  is  the  angular  velocity  of  the  body.  In  the  general  case, 
the  instantaneous  axis  is  displaced  in  the  body  and  the  moment 
of  inertia  Ic  varies  with  time.  The  value  of  Ic  is  constant  if 
the  body  is  in  plane  motion  (1.5.9). 

Example.  The  kinetic  energy  of  a  homogeneous  circular  cylinder 
rolling  down  an  inclined  plane  without  slipping.  The  cylinder 
has  plane  motion  because  all  of  its  points  travel  in  vertical 
planes  that  are  parallel  to  one  another.  The  cylinder  travels 
with  translational  motion  at  the  velocity  vc  directed  along 
the  inclined  plane,  and  rotates  about  its  axis  ( Ic  =  mR2l 2, 
where  m  and  R  are  the  mass  and  radius  of  the  cylinder)  at 
the  angular  velocity  <n.  From  the  condition  that  the  cylinder 
rolls  without  slipping  it  follows  that  the  instantaneous  veloci¬ 
ties  of  the  points  of  contact  between  the  cylinder  and  inclined 
plane  equal  zero,  i.e.  co  =  vc/R.  Hence,  the  kinetic  energy  of 
the  rolling  cylinder  is 


K  k  = 


~2~ 


Ic  co2  _  3 
2  _  4 


™VC- 


4.4  Law  of  Conservation  of  Angular  Momentum 

4.4.1  The  law  of  the  conservation  of  angular  momentum  can  be 
formulated  as  follows:  the  angular  momentum  of  a  closed  system 
(2.2.4)  about  any  fixed  point  remains  constant  with  time,  i.e. 

==  0  and  L  =  const. 
at 

Accordingly,  the  angular  momentum  of  a  closed  system  about 
its  centre  of  mass  (4.1.7)  remains  constant  with  time: 

=  0  and  Lc=  const. 

Like  the  laws  of  conservation  of  momentum  and  energy,  the 
law  of  conservation  of  angular  momentum  is  of  significance  far 
beyond  the  scope  of  classical  mechanics.  It  belongs  to  the  most 
fundamental  of  physical  laws  because  it  is  associated  with 
a  definite  symmetry  property  of  space:  its  isotropy.  The  isotropy 
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of  space  is  manifested  in  the  fact  that  the  physical  properties 
and  laws  of  motion  of  a  closed  system  are  independent  of  the 
direction  chosen  for  the  coordinate  axes  in  an  inertial  frame 
of  reference.  This  means  that  these  properties  and  laws  do  not 
change  when  the  closed  system  is  turned  as  a  whole  through 
any  angle  in  space. 

According  to  modern  concepts,  not  only  particles  and  bodies 
can  have  an  angular  momentum,  but  fields  as  well.  Elementary 
particles  and  systems  built  from  them  (for  instance,  atomic 
nuclei)  can  have  an  angular  momentum  that  is  not  related 
to  their  motion  in  space  and  is  called  their  spin  (Table  43.1). 

4.4.2  With  respect  to  systems  that  can  be  described  by  classical 
(Newtonian)  mechanics,  the  law  of  conservation  of  angular 
momentum  can  be  regarded  as  a  consequence  of  Newton’s  laws. 
For  a  closed  mechanical  system,  the  resultant  moment  of  exter¬ 
nal  forces  about  any  fixed  point  (as  well  as  about  the  centre  of 
mass  of  the  system)  is  identically  zero:  Mext  =  0  (consequently, 
Mecxt  =  0,  see  4.1.6,  where  F  =  Fext  =  0)  and  the  law  of 
conservation  of  angular  momentum  follows  from  4.3.1: 

n 

L==  2  [rimi\i]  =  const, 
i=l 

where  m;,  r;  and  v;  are  the  mass,  radius  vector  and  velocity  of 
the  ith  particle  of  the  system,  which  consists  of  n  such  particles. 
Accordingly  (see  4.1.7  and  2.5.3), 

n  n 

Lc=  2  [rjTOiv{l  =  2  [r-mi'Vi]  =  const, 
i=l  i=l 

where  r-  =  rt  —  rc,  \[  =  v;  —  vc,  and  rc  and  vc  are  the 
radius  vector  and  velocity  of  the  system’s  centre  of  mass. 

4.4.3  If  a  system  is  not  closed,  but  the  external  forces  acting 
on  it  are  such  that  their  resultant  moment  about  fixed  point  0 
is  identically  zero  (Mcxt  =  0),  then,  according  to  Newton’s 
laws  (4.3.1),  the  angular  momentum  of  the  system  about  the 
same  point  0  remains  constant  with  time:  L  =  const.  This 
condition  is  practically  complied  with,  for  example,  by  a  bal¬ 
anced  gyroscope  (4.3.2)  with  three  degrees  of  freedom,  if  the  mo¬ 
ment  of  the  friction  force  in  its  suspension  is  sufficiently  small. 
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If  the  stand  of  the  gyroscope,  holding  its  centre  of  suspension  at 
rest,  is  turned  in  any  way,  the  axis  of  the  gyroscope  retains  its 
orientation  with  respect  to  a  fixed  inertial  frame  of  reference.* 
Usually  Mext  ^  0  and  L  =/=  const.  But  if  the  resultant  moment 
of  external  forces  about  any  fixed  axis  passing  through  point  0 
is  identically  zero,  the  angular  momentum  Of  the  system  about 
this  axis  remains  constant  with  time.  For  instance,  if  M fxt  =  0, 
then  Lz  =  const. 

If  the  system  rotates  about  fixed  axis  OZ  and  the  resultant 
moment  of  external  forces  about  this  axis  Mfxt  =  0,  the  angular 
momentum  of  the  system  about  the  axis  of  rotation  remains 
constant  with  time: 

I  za>  —  const, 

where  to  and  / 2  are  the  angular  velocity  and  moment  of  inertia 
of  the  system. 

If  the  action  of  internal  forces,  as  well  as  external  forces  that 
comply  with  the  condition  A/®xt  =  0,  leads  to  deformation 
of  the  system,  changing  its  moment  of  inertia  I z,  the  angular 
velocity  co  correspondingly  increases  or  decreases. 

4.4.4  Free  axes  of  a  body  are  ones  about  which  a  free  rigid  body 
(2.2.3)  can  rotate  at  constant  angular  velocity  ©  in  the  absence 
of  all  external  action.  Such  rotation  of  a  body  is  said  to  be  free, 
or  inertial,  rotation.  The  free  axes  of  a  body  coincide  with  its 
principal  central  axes  of  inertia  (4.2.4).  In  the  general  case, 
the  values  Iu  I2  and  I3  of  the  principal  central  moments  of 
inertia  (4.2.4)  differ.  The  free  rotation  of  such  a  body  (for  ex¬ 
ample,  a  homogeneous  rectangular  parallelepiped  with  edges 
of  different  length)  is  accomplished  practically  only  about  two 
free  axes,  corresponding  to  the  extreme  values  of  the  principal 
central  moments  of  inertia— the  maximum  and  minimum  val¬ 
ues.  The  rotation  of  the  body  about  its  third  principal  central 
axis,  corresponding  to  the  intermediate  value  of  the  moment 
of  inertia  of  the  body,  proves  to  be  unstable:  even  very  small 


*  It  is  assumed  that  vector  L  is  directed  along  the  axis  of  the 
gyroscope.  Otherwise,  a  free  gyroscope  has  a  precessional  motion: 
ils  axis  describes  a  circular  conical  surface  whose  vertex  is  at 
the  centre  of  suspension  and  whose  axis  is  directed  along  the 
vector  L  =  const. 
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external  action  is  capable  ol  leading  to  substantial  deviation 
of  the  instantaneous  axis  of  rotation  from  its  initial  direction 
in  the  body. 

If  the  values  of  two  principal  central  moments  of  inertia  of 
a  body  are  equal:  /x  =  /2  =/=  I3,  stable  free  rotation  of  such 
a  body  (for  instance,  a  homogeneous  circular  cylinder)  is  possi¬ 
ble  only  about  the  free  axis  corresponding  to  the  third,  differing 
value  I3  of  the  moment  of  inertia.  For  a  homogeneous  circular 
cylinder  such  a  free  axis  is  its  axis  of  symmetry.  But  if  a  long 
thin  cylinder  is  rotated  by  means  of  a  string  secured  to  one  end , 
stable  rotation  of  the  cylinder  can  be  accomplished  about  the 
free  axis  corresponding  to  the  maximum  value  of  its  moment 
of  inertia.  This  free  axis  is  perpendicular  to  the  axis  of  symmetry 
of  the  cylinder. 


CHAPTER  5  FUNDAMENTALS  OF  THE  SPECIAL 
THEORY  OF  RELATIVITY 

5.1  Postulates  of  the  Special  Theory  of  Relativity 

5.1.1  The  special  theory  of  relativity  (sometimes  called  the 
restricted  relativity  theory)  is  the  modern  physical  theory  of  space 
and  time.  The  special  relativity  theory  and  quantum  mechan¬ 
ics  (38.1.1)  serve  as  the  theoretical  basis  for  all  modern  physics 
and  engineering  (for  example,  nuclear  physics  and  nuclear 
technology).  The  special  relativity  theory  is  often  called  relati¬ 
vistic  theory,  and  specific  phenomena  described  by  this  theorj 
are  called  relativistic  effects.  Relativistic  effects  are  manifested, 
as  a  rule,  at  velocities  of  bodies  close  to  that  of  light  in  free 
space  (c  =  3  X  10s  m/s)  and  called  relativistic  velocities.  Relati¬ 
vistic  mechanics  refers  to  the  mechanics  of  motion  at  relativistic 
velocities  and  is  based  on  the  special  theory  of  relativity. 

As  in  classical  Newtonian  mechanics  it  is  assumed  in  special 
relativity  theory  that  time  is  uniform  (3.4.2)  and  that  space 
is  homogeneous  (2.7.1)  and  isotropic  (4.4.1). 

5.1.2  The  special  theory  of  relativity  is  based  on  two  main 
principles  accepted  as  the  initial  postulates. 

The  first  postulate  is  a  generalization  of  Galileo’s  mechanical 
principle  of  relativity  (2.8.4)  to  cover  any  physical  processes. 
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This  postulate,  called  the  principle  of  relativity,  or  Einstein's 
relativity  principle,  states:  all  physical  phenomena  proceed  in 
I  he  same  way,  under  the  same  conditions,  in  all  inertial  frames 
of  reference  (2.1.2).  In  other  words,  the  principle  of  relativity 
contends  that  physical  laws  are  independent  (invariant)  with 
respect  to  the  chosen  inertial  frame  of  reference;  equations 
expressing  these  laws  have  the  same  form  in  all  inertial  frames 
of  reference.  Consequently,  it  is  impossible  to  ascertain,  by 
means  of  any  physical  experiments  whatsoever,  conducted 
in  a  closed  system  of  bodies  (2.2.4),  whether  the  system  is  at 
rest  or  is  travelling  at  uniform  velocity  in  a  straight  line  with 
respect  to  some  inertial  frame  of  reference.  All  inertial  frames 
of  reference  are  equally  justified  in  physics.  It  is  impossible, 
on  the  basis  of  physical  experiments,  to  select  from  the  host 
of  inertial  reference  frames  any  “preferred”  or  “absolute”  frame 
of  reference  possessing  any  qualitative  distinctions  from  other 
inertial  reference  frames. 

5.1 .3  The  second  postulate  expresses  the  principle  of  the  invariance 
nf  the  velocity  of  light:  the  velocity  of  light  in  free  space  is 
independent  of  the  motion  of  the  source  of  light.  It  is  the  same 
in  all  directions  and  in  all  inertial  frames  of  reference,  being 
one  of  the  most  basic  physical  constants.  Experiments  show 
l  hat  c,  the  velocity  of  light  in  free  space,  is  the  limiting  velocity 
in  nature.  The  velocity  of  any  particles  and  bodies,  as  well  as 
l  lie  velocity  of  propagation  of  any  interactions  or  signals,  can¬ 
not  exceed  c. 

These  specific  laws  governing  the  process  of  light  propagation 
i  n  free  space  enable  this  real  physical  process  to  be  made  use  of 
in  establishing  a  procedure  for  the  chronometry  of  a  frame  of 
reference,  i.e.  lor  the  synchronization  of  clocks  located  at  various 
points  in  space  and  travelling  together  with  the  frame  of  reference 
lining  considered  (5.2.3). 

5.1.4  The  postulates  of  the  special  theory  of  relativity  contra¬ 
dict  the  concepts  of  the  properties  of  space  and  time  that  are 
accepted  in  classical  mechanics  and  on  which  the  Galilean 
Iransformations  are  based  (2.8.1).  In  particular,  this  refers 
in  the  contention,  considered  to  be  “self-evident”  in  Newtonian 
mechanics,  that  the  passage  of  time  is  identical  in  all  inertial 
frames  of  reference.  From  this  it  follows  that  the  time  interval 
lie  tween  any  two  events  is  of  an  absolute  nature.  For  instance, 
if  two  events  occur  simultaneously  according  to  the  clock  in 
mie  inertial  frame  of  reference,  then,  according  to  classical 
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concepts,  they  also  occur  simultaneously  according  to  the  clock 
of  any  other  inertial  frame. 

The  above-mentioned  contradiction  can  be  illustrated  by  the 
following  example  (Fig.  5.1).  We  have  two  inertial  frames  of 
reference:  fixed  frame  K  and  frame  K'  which  travels  along  the 
OX  axis  at  constant  velocity  V.  Assume  that  a  flash  of  light  is 
produced  at  point  0  at  the  instant  of  the  zero  time  reference 
in  both  frames  K  and  K'  (t  =  t'  =  0),  when  their  origins  0 


and  O'  of  coordinates  coincide.  By  the  instant  of  time  t  >  0, 
light,  propagating  in  free  space  at  the  velocity  c,  reaches 
points  at  the  surface  of  a  hollow  sphere  in  the  frame  K.  This 
sphere  has  its  centre  at  point  O  and  its  radius  equals  ct. 

It  can  be  assumed  that  in  frame  Ii'  the  light  flash  occurred  at 
the  instant  of  time  t'  —  0  at  point  O'.  Hence,  according  to  the 
postulates  of  the  special  theory  of  relativity,  by  the  instant 
of  time  t'  =  t,  light  in  frame  K'  reaches  points  at  the  surface 
of  a  hollow  sphere  of  the  same  radius  ct  as  in  the  frame  K  but 
with  its  centre  at  point  O'.  At  this  instant,  however,  point  O' 
is  not  at  point  O,  but  at  a  distance  of  Vt  from  this  point.  Hence, 
any  attempt  to  combine  the  postulates  of  the  special  theory  of 
relativity  with  classical  conceptions  of  absolute  time  that  elapses 
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identically  in  all  frames  of  reference  leads  to  an  absurdity: 
the  light  flash  must  simultaneously  reach  points  in  space  that 
belong  to  two  different  hollow  spheres. 


5.2  Simultaneity  o!  Events.  Synchronization  of  Clocks 

5.2.1  The  concept  of  the  simultaneity  of  two  or  several  events 
is  extensively  employed  in  making  various  physical  measure¬ 
ments.  For  example,  to  determine  the  length  l  of  a  rod,  arranged 
along  the  axis  OX  of  reference  frame  K  and  travelling  with 
respect  to  this  frame,  it  is  necessary  to  note  the  values  of  the 
coordinates  x2  (/)  and  x1  (t)  of  the  ends  of  the  rod  simultaneously, 
i.e.  at  the  same  instant  of  time  t.  Then 

l  =  I  *2  (t)  —  *i  (t)  I- 

The  instant  of  time  at  which  some  event  occurs  (for  instance, 
the  launching  or  touchdown  times  of  a  spaceship)  is  determined 
by  noting  a  clock’s  reading  that  is  simultaneous  with  the  event 
being  considered.  This  can  be  simply  done  by  means  of  a  clock 
located  at  the  place  where  the  event  occurs.  Hence,  there  should 
be  many  clocks  located  at  various  points  in  space  in  each  frame 
of  reference.  Obviously,  these  clocks  should  be  in  agreement 
with  one  another,  i.e.  run  synchronously:  their  readings  should 
be  identical  at  each  instant  of  time  t. 

5.2.2  The  synchronism  of  clocks  located  adjacently,  i.e.  at 
the  same  place  in  space,  can  be  checked  by  the  coincidence  of 
their  readings  at  each  arbitrary  instant  of  time.  The  synchronism 
of1  clocks  that  are  separated  in  space  at  the  points  A  and  B, 
distant  from  each  other,  could  be  checked  in  a  similar  manner 
if  we  had  at  our  disposal  standard  time  signals  that  could  pro¬ 
pagate  instantaneously  from  A  to  B.  Experiments  indicate, 
however,  that  this  method  is  unfeasible  because  the  velocity 
of  any  signal  cannot  exceed  the  velocity  of  light  in  free  space. 

We  can,  however,  proceed  in  the  following  manner.  We  can 
convey  the  clock  from  B  to  A ,  check  whether  it  is  synchronous 
with  the  clock  located  at  A ,  and  then  convey  the  clock  carefully 
back  to  B.  Then  we  can  check  whether  the  clock  brought  back 
to  B  runs  at  the  same  rate  as  the  clock  left  at  A  by  transmitting 
time  signals  from  A  to  B  after  definite  equal  time  intervals 
according  to  the  clock  at  A.  But  it  cannot  be  established  by 
this  procedure  whether  a  shift  in  the  zero  time  reference  has 
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occurred  in  the  clock  reading  while  it  was  being  conveyed  back 
to  B,  i.e.  whether  the  clock  brought  back  to  B  is  now  fast  or 
slow  with  respect  to  the  clock  at  A  by  a  constant  amount  At. 

5.2.3  The  problem  of  the  synchronism  of  clocks  separated  in 
space  at  points  A  and  B  can  be  solved  only  by  an  unambiguous 
agreement  (or  definition)  about  when  these  clocks  are  to  be 
regarded  as  synchronous.  Einstein  made  use  of  a  real  physical 
process— the  propagation  of  light  in  free  space— as  the  basis 
for  such  a  definition.  He  proceeded  from  the  facts  that  the 
velocity  of  light  in  free  space  is  the  maximum  possible  velocity 
at  which  signals  can  be  transmitted  in  nature  and  that  it  is 
identical  in  all  directions  and  in  all  inertial  frames  of  _ ref¬ 
erence. 

Assume  that  according  to  the  clock  at  point  A  the  light  signal 
is  transmitted  from  this  point  at  the  instant  of  time  tj  and, 
after  being  reflected  at  point  B,  returns  to  A  at  the  instant  of 
time  ta.  Then,  by  definition,  the  clock  at  B  is  in  synchronism 
with  that  at  A  if  they  run  at  the  same  rate  and  if,  at  the  instant 
the  light  signal  reaches  B,  the  clock  located  there  indicates  the 
time  t2  =  (tj  -f-  f3)/2. 

5.2.4  According  to  the  special  theory  of  relativity,  the  passage 
of  time  may  differ  in  different  inertial  frames  of  reference. 
Consequently,  the  time  interval  between  any  two  definite 
events  is  relative:  it  can  change  in  going  over  from  one  inertial 
reference  frame  to  another.  In  particular,  the  simultaneity  of 
two  events  occurring  at  different  points  separated  in  space  is 
also  relative.  Events  occurring  simultaneously  in  one  inertial 
frame  of  reference  are  not  at  all  simultaneous  in  other  inertial 
frames  that  are  travelling  with  respect  to  the  first  frame.  In 
some  frames  of  reference  one  of  these  events  occurs  before  the 
other  event;  in  other  frames  it  occurs  after  the  other  event. 

Thus,  in  the  example  illustrated  in  Fig.  5.1  (see  5.1.4),  the 
arrival  of  the  light  flash  at  points  A  and  B  are  events  that 
occur  simultaneously  in  the  fixed  frame  of  reference  K.  In  the 
travelling  frame  of  reference  K‘  these  events  are  not  simulta¬ 
neous.  The  light  flash  reaches  A,  receding  from  the  light  source 
O',  later  than  it  reaches  B,  approaching  the  source  O'. 

Events  connected  by  a  eause-and-effect  relationship,  i.e.  by 
causality,  cannot  occur  simultaneously  in  any  frame  of  reference 
whatsoever,  because  any  effect  is  due  to  some  process  initiated 
by  the  cause.  At  the  same  time,  no  process  (physical,  chemical, 
biological,  etc.)  can  proceed  instantaneously.  Therefore,  rela~ 
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tivity  does  not  contradict  causality  in  any  way.  In  any  iner¬ 
tial  frame  of  reference  an  event  that  is  an  effect  always  occurs 
later  than  the  event  causing  this  effect. 


5.3  Lorentz's  Transformations 


5.3.1  It  follows  from  the  postulates  of  the  special  theory  of 
relativity,  as  well  as  from  the  homogeneity  (2.7.1)  and  isotropy 
(4.4.1)  of  space  and  the  uniformity  of  time  (3.4.2),  that  the  rela¬ 
tions  between  the  coordinates  and  times  of  the  same  event  in 
two  inertial  frames  of  reference  are  expressed  by  the  Lorentz 
transformations ,  rather  than  the  Galilean  transformations  (2.8.1) 
used  in  classical  (Newtonian)  mechanics.  According  to  the  prin¬ 
ciple  of  relativity  and  the  above-mentioned  properties  of  the 
symmetry  of  space  and  time,  the  Lorentz  transformations  should 
ho  linear. 

5.3.2  The  Lorentz  transformations  are  in  their  simplest  form 
if  the  like  axes  of  the  Cartesian  coordinates  of  the  fixed  (K) 
and  moving  ( K' )  inertial  frames  of  reference  are  pairwise  paral¬ 
lel  to  each  other  and  if  frame  K'  travels  with  respect  to  frame  K 
at  the  constant  velocity  V  along  axis  OX  (Fig.  2.3,  see  2.8.1). 
If,  in  addition,  the  zero  time  reference  in  both  frames  ( t  =  0 
and  t'  -----  0)  is  taken  as  the  instant  when  the  origins  O  and  O' 
of  coordinates  of  the  two  systems  coincide,  then  the  Lorentz 
Iransformations  are  of  the  form 


x — Vt 

jJ  =  Y 1— F7c2  ’ 


x'  +  Vt' 

Y 1  —  F2/c2  ’ 


v'  =  y,  y  =  y', 

z'  —  z,  z  —  z', 


t—Vx/c 2  f  t'  +  Vx’/c 2 

Y  i  —  V*lc*  ’  ~  Y 1  — y2/c2  ’ 


where  c  is  the  velocity  of  light  in  free  space. 

5.3.3  The  Lorentz  transformations  show  that  not  only  the  spatial 
coordinates  of  an  event  being  considered  are  changed  in  going 
over  from  one  inertial  frame  of  reference  to  another,  but  the 
corresponding  instants  of  time  as  well.  But  a  definite  interrela¬ 
tion  exists  between  the  spatial  coordinates  x' ,  y'  and  z  of 
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an  event  and  the  time  instant  t'  it  occurs  in  an  arbitrary  inertial 
frame  of  reference  K' .  Hence,  the  quantity  [(s')2  +  (y')2  + 
-f-  fz')2  —  c2  (I')2]  is  independent  of  the  velocity  V  of  frame  K' , 
i.e.  it  is  identical  in  all  inertial  frames  of  reference: 

(x’Y  +  (y')2  +  (z')2  -  c2  (O2  =  x2  +  y2  +  z2  —  c2!2. 


The  coordinate  x  and  time  t’  cannot  be  imaginary.  It  follows, 
therefore,  from  the  Lorentz  transformations  that  the  velocity 
of  relative  motion  of  any  two  inertial  frames  of  reference  cannot 
exceed  the  velocity  of  light  in  free  space  (V  <  c). 

5.3.4  According  to  Einstein’s  principle  of  relativity  (5.1.2), 
physical  laws  must  comply  with  the  condition  of  relativistic 
invariance  (Lorentz  invariance).  This  requirement  implies  that 
equations  expressing  physical  laws  retain  their  form  in  going 
over  from  one  inertial  frame  of  reference  to  another  in  accord¬ 
ance  with  the  Lorentz  transformations. 

The  Lorentz  transformations  reduce  to  the  Galilean  transforma¬ 
tions  (2.8.1)  at  V  <  c  or,  more  precisely,  in  the  limit  as  Vic 
-*•  0,  i.e.  as  c  oo.  In  other  words,  the  Galilean  transforma¬ 
tions  and  classical  (Newtonian)  mechanics  based  on  them  are 
founded  on  the  assumption  of  instantaneous  propagation  of 
interaction.  Such  an  approximate  approach  is  permissible  only 
in  considering  the  laws  of  mechanical  motion  of  bodies  at  veloci¬ 
ties  a  great  many  times  less  than  the  velocity  of  light  in  free 
space. 


5.4  Relativity  of  Lengths  and  Time  intervals. 

Time  Interval  Between  Two  Events 

5.4.1  It  follows  from' the’ Lorentz  transformations  (5.3.2)  that 
the  linear  dimension  of  a  body  travelling  with  respect  to  fan 
inertial  frame  of  reference  is  reduced  in  the  direction  of  motion. 
This  change  in  the  longitudinal  size  of  a  body  in'motion'is  called 
the  Lorentz- FitzGerald  contraction.  Assume  l0  to  be  the  length 
of  a  rod  at  rest  in  reference  frame  K' .  If  the  rod  is  located  along 
axis  O'X'  (Fig.  5.2),  then  l0  —  x2  —  x[,  where  x'2  and  x\  are 
the  coordinates  of  the  ends  of  the  rod.  The  length  l  of  the  same 
rod  in  reference  frame  K,  with  respect  to  which  it  travels  along 
axis  OX  at  the  velocity  V,  is  equal  to  the  difference  in  the  coor- 
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dinates  of  the  ends  of  the  rod,  measured  at  the  same  instant  of 
time  t\ 

l  =  X2  (t)-*!  ( t )  =  ( x'2-x[ )  Y 1  —  V*/e*  =  l0  /l  — v*/c*. 

Transverse  dimensions  of  a  body  are  independent  of  its  velocity 
and  are  identical  in  all  inertial  frames  of  reference: 

y 2  —  yi  =  Hz  y i  and  z2  zj  =  z2  z4. 

'Thus,  the  linear  dimensions  of  a  body  are  relative.  They  are 
maximal  in  the  inertial  frame  of  reference  with  respect  to  which 
the  body  is  at  rest.  These  are 
said  to  be  the  body’s  proper 
dimensions. 

5.4.2  The  Lorentz-FitzGerald 
contraction  is  a  kinematic  effect 
of  the  special  theory  of  relativ¬ 
ity.  It  is  not  due  to  the  effect  of 
any  kind  of  longitudinal  forces 
compressing  the  body  along  its 
direction  of  motion.  This  con¬ 
traction  becomes  appreciable 
only  at  velocities  approaching 
that  of  light  in  free  space.  It 
follows  from  the  contraction  formula  that  bodies  cannot  travel 
at  velocities  V  >  c  because  at  V  =  c  the  longitudinal  dimen¬ 
sion  of  the  body  (in  the  direction  of  motion)  becomes  zero,  and 
at  V ■>  c  it  would  have  to  become  imaginary. 

5.4.3  It  is  evident  from  the  Lorentz  transformations  that  in 
the  theory  of  relativity  we  can  specify  a  definite  “instant  of 
time”  only  with  respect  to  a  single  definite  inertial  frame  of 
reference.  Thus,  for  instance,  a  single  “instant  of  time”  in  ref¬ 
erence  frame  K  (a  single  definite  value  of  time  t  in  this  frame) 
corresponds  to  many  values  of  time  l1  in  frame  K' ,  depending 
upon  the  values  of  coordinate  x: 

f,  __  t—Vx/c 2 

/ 1  —  V2/c% 

Likewise,  a  single  “instant  of  time”  in  frame  K‘ ,  i.e.  a  single 
definite  value  of  time  t',  corresponds  to  many  values  cf  time  i 


YiK 


_V 


0  0'  cc,(t)  cc2(t)  X 

Fig.  5.2 
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in  frame  K,  depending  upon  the  values  of  coordinate  x'\ 
f_  t'+Vx'/c 2 
/l— F2/c2 

5.4.4  Another  important  consequence  of  the  Lorentz  transfor¬ 
mations  is  the  relativity  of  the  time  interval  between  any  two 
events  (for  example,  the  beginning  and  end  of  some  process), 
i.e.  the  dependence  of  this  time  interval  on  the  choice  of  the 
inertial  frame  of  reference.  Assume  that  two  events,  1  and  2, 
occur  in  the  moving  inertial  frame  of  reference  K'  at  the  same 
point  A,  fixed  in  frame  iT  (i.e.  x2  =  x[),  at  the  instants  of  time 
t[  and  t2.  Thus,  the  time  interval  between  the  events  is  r0  = 
—  tv  tf 

With  respect  to  fixed  inertial  frame  of  reference  K,  point  A 
travels  with  the  same  velocity  V  as  does  frame  K'.  Hence  in 
frame  K  events  1  and  2  occur  at  different  points  in  space,  having 
the  coordinates  x1  and  x2,  and  x2  —  xx  =  Ft,  where  t  =  t2  —  tx 
is  the  time  interval  between  events  1  and  2  according  to  the 
clock  in  frame  K.  It  follows  from  the  Lorentz  transformations 
that 

t  . .  <2~~  L  __  To 

t  —  iq  L 1  —  r -  —  r  *  • 

~\f  1  —  F2/c2  /l—  F2/c2 

Thus,  the  time  interval  between  two  events  is  minimal  in  the 
inertial  frame  of  reference  with  respect  to  which  both  events 
occur  at  the  same  point.  The  time  measured  by  a  clock  travel¬ 
ling  together  with  some  object  is  said  to  be  the  proper  time 
of  the  object. 

5.4.5  The  consequence  of  the  Lorentz  transformations  that  is 
dealt  with  in  the  preceding  subsection  is  evidence  of  the  existence 
of  the  relativistic  effect  of  the  slowing  of  time,  called  time 
dilation ,  in  a  moving  inertial  frame  of  reference  as  compared 
to  a  fixed  one.  A  clock  travelling  at  the  velocity  V  with  respect 
to  a  given  inertial  frame  runs  slower  by  a  factor  of  1  —  F2/c2 
than  a  fixed  clock.  Correspondingly,  in  agreement  with  the 
relativity  principle,  all  physical  processes  in  a  moving  frame 
of  reference  proceed  at  a  slower  rate  than  in  a  fixed  frame. 

The  time  dilation  effect  becomes  appreciable  only  at  extremely 
high  velocities  F,  close  to  that  of  light  in  free  space.  This  effect 
has  been  demonstrated,  for  example,  by  experiments  conducted 


S.4]  RELATIVITY  OF  LENGTHS  AND  TIME  INTERVALS  93 


with  muons  (43.3.3).  The  muon  is  an  unstable  elementary  parti¬ 
cle.  The  mean  proper  lifetime  of  muons  (according  to  a  clock  in 
the  inertial  frame  of  reference  with  respect  to  which  they  are 
at  rest)  x0  =  2.2  x  10~6  s.  Muons  are  created  in  the  upper 
layers  of  the  atmosphere  by  the  action  of  primary  cosmic  rays, 
and  they  travel  with  respect  to  the  earth  at  velocities  V  close 
to  c.  If  there  was  no  relativistic  effect  of  time  dilation,  a  muon 
would  be  capable  of  travelling  an  average  distance,  with  respect 
to  an  observer  on  the  earth,  of  x0c  =  660  m  through  the  at¬ 
mosphere.  In  other  words,  muons  could  not  reach  the  earth’s 
surface.  Actually,  they  are  registered  by  instruments  installed 
at  the  surface  of  the  earth,  because  the  mean  lifetime  of  a  trav¬ 
elling  muon,  according  to  the  clock  of  the  observer  on  the  earth, 
t  =  Tq/V^I  —  V2/c2  >  t0,  and  the  path  travelled  by  a  muon 
during  this  time,  xV  >  660  m. 

5.4.6  The  relativistic  effect  of  time  dilation  in  a  spaceship 
travelling  with  respect  to  the  earth  makes  it  feasible  to  under¬ 
take  space  flights  of  almost  unlimited  length  and  journeys 
“into  the  future”.  According  to  the  relativity  principle,  all 
processes  in  a  spaceship,  including  the  aging  process  of  the 
astronauts,  proceed  in  agreement  with  the  same  laws  as  on  the 
earth.  But  in  the  spaceship  time  must  be  measured  by  a  clock 
travelling  together  with  it  at  the  velocity  V  with  respect  to  the 
earth.  If  V  is  close  to  c  the  clock  in  the  ship  runs  considerably 
slower  than  earth  clocks  (at  the  space-launch  complex),  by  a 
factor  of  1/j/l-—  V2/c2.  For  example,  at  P  =  Vic  =  0.99999 
the  rates  of  the  spaceship  and  earth  clocks  differ  by  a  factor 
of  224.  Consequently,  in  a  time  interval  of  t0  =  10  years  in 
such  a  spaceship,  one  can  make  a  space  flight  taking  x  =  2240 
years  according  to  earth  clocks,  aging  only  10  years  during 
the  flight.  In  this  flight  the  spaceship  can  travel  a  tremendous 
distance  from  the  earth:  l  =  Vx  —  Per  =  2239.98  light  years*. 
The  closer  V  is  to  c,  the  greater  the  distance  l  travelled  by  the 
spaceship  with  respect  to  the  earth  in  the  same  interval  x0 
of  proper  time  in  the  spaceship,  i.e.  the  longer  the  space  flight 
that  the  astronauts  can  make  during  their  lifetime. 

If  an  astronaut  completes  a  space  flight  at  a  velocity  V  close 
to  c  by  returning  to  the  earth,  he  will  find  that  people  remaining 


*  A  light  year  is  the  distance  travelled  by  light  in  free  space  in 
one  year:  1  light  year  =  9.4605  X  1015  m. 
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on  the  earth  (including  his  twin  brother)  has  aged  more  than 
he  has  during  the' flight.  If  V  differs  from  c  by  a  sufficiently 
small  amount,  when  (1  —  y2/c2)~V2  >  1,  the  astronaut  can, 
during  the  flight,  outlive  all  his  contemporaries  and  find  him¬ 
self,  upon  his  return,  among  future  generations. 

5.4.7  It  would  seem,  on  the  face  of  it,  that,  on  the  basis  of  the 
relativity  principle,  we  should  arrive  at  a  conclusion  exactly 
the  opposite  of  that  given  in  the  preceding  subsection.  The  clock 
on  the  earth,  travelling  at  the  velocity  —V  with  respect  to  the 
spaceship,  should  run  slower  than  the  clock  in  the  ship.  There¬ 
fore,  the  time  required  for  the  flight  should  be  longer  for  the 
astronaut,  and  not  for  the  people  left  on  the  earth.  Hence,  the 
twin  that  made  the  flight  should  age  more  than  his  brother. 

We  have  thus  found  that  the  difference  between  the  readings 
of  the  clocks  at  the  space-launch  complex  and  in  the  ship  is 
positive  according  to  5.4.6,  while,  on  the  other  hand,  it  is 
negative.  This  absurd  result  is  called  the  clock,  or  twin,  paradox. 
Actually,  no  clock  paradox  exists.  It  arose  due  to  improper 
application  of  the  relativity  principle.  This  principle  contends 
that  not  all  frames  of  reference  are  equally  justified,  but  only 
all  inertial  frames  of  reference.  But  the  frame  of  reference  of 
the  spaceship,  in  contrast  to  that  of  the  earth,  is  not  an  inertial 
one  all  throughout  the  flight.  During  launching,  turning  around 
and  in  braking  orbit  when  approaching  the  earth,  the  spaceship 
travels  with  acceleration  (or  decelaration).  Consequently,  the 
problem  concerning  the  running  of  the  clock  at  the  launching 
complex  which  is  always  at  rest  with  respect  to  a  single  inertial 
frame,  and  that  of  the  clock  in  the  spaceship,  is,  in  principle, 
not  a  symmetrical  one.  Hence,  the  earth  and  ship  frames  of 
reference  are  not  equally  justified  in  the  given  problem.  The 
reasoning  in  Sect.  5.4.6  is  the  correct  one  because  it  is  based 
on  the  use  of  an  inertial  (the  earth)  frame  of  reference.  Accord¬ 
ingly,  the  reasoning  given  in  Sect.  5.4.7,  leading  to  the  clock 
paradox,  is  incorrect.  In  the  second  case,  the  general  theory  of 
relativity  should  be  applied,  rather  than  the  special  theory. 
When  this  is  done,  it  is  found  that,  from  the  point  of  view  of 
the  astronaut  as  well,  his  clock  runs  slower  than  the  clock  at 
the  launching  complex. 

5.4.8  The  interval,  or  space-time  interval,  between  two  events, 
measured  in  the  inertial  frame  of  reference  K' ,  is  the  quantity 
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where  t'i2  =  t'2  —  t\  is  the  time  interval  between  the  two  events 
being  considered  (according  to  the  clock  in  frame  A''),  and 

l'i2  =  / (x'2  —  xl)2  +  (j/a  —  y'if  +  (4  ~  ZI)2  is  the  distance 
l>etween  the  points  at  which  events  1  and  2  occur,  also  measured 
in  frame  of  reference  K' . 

It  follows  from  the  Lorentz  transformations  that  the  interval 
between  two  given  events,  1  and  2,  is  invariant  with  respect 
to  the  choice  of  the  inertial  frame  of  reference,  i.e.  it  does  not 
change  in  transferring  from  moving  inertial  frame  K'  to  fixed 
frame  K: 

si2  =  si2  =  inv,  where  s12=VcH\2—l\2. 

Jf  sf2  >  0,  i.e.  if  s)2  is  a  real  number,  the  interval  s12  is  said 
to  be  timelike. 

The  interval  s12  is  said  to  be  spacelike  if  s?2  <  0,  i.e.  s]2  is  an 
imaginary  number. 

5.4.9  It  follows  from  the  invariance  of  the  interval  with  respect 
to  the  choice  of  the  inertial  frame  K'  that  in  all  frames  1C  the 
values  of  t[2  and  Z[2  for  the  giv¬ 
en  two  events,  1  and  2,  satis¬ 
fy  the  equation  of  a  hyperbola: 

l:~  0l2)2  (^12)2  =  s12- 

If  sf2>0  the  relation  between 
i'l2  and  l'i2  in  various  inertial 
frames  of  reference  K',  travel¬ 
ling  with  respect  to  fixed  frame  K 
a  f  any  possible  velocities  (0  sgf 
;  V  <  c) ,  can  be  represented 
graphically  as  the  two  branches 
/  and  II  of  a  hyperbola  (Fig.5.3). 

Consequently,  the  sign  of  the 
l  i me  interval  between  events  1 
and  2,  related  by  a  timelike  in- 
lorval,  is  absolute.  It  is  indepen¬ 
dent  of  the  choice  of  the  frame  of  reference:  in  all  frames  K'  the 
second  event  always  occurs  either  after  the  first,  i.e.  t'i*  >  0 
(branch  I)  or  always  before  the  first,  i.e.  tl2  <  0  (branch  II). 
The  distance  l'l2  is  relative;  an  inertial  frame  K'  can  be  chosen 
in  which  l[2  =  0,  i.e.  events  1  and  2  occur  at  the  same  place 
(points  A  and  B  on  the  branches  /  and  II  of  the  hyperbola). 


Fig.  5.3 
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Two  events  related  by  causality  should  always  correspond  to 
a  timelike  interval  or,  in  any  case,  one  equal  to  zero  (sI2  =  0). 
This  is  due  to  the  fact  that  the  signal  by  means  of  which  event  1 
(the  cause)  leads  to  event  2  (the  effect)  cannot  be  propagated 
in  space  at  a  velocity  exceeding  that  of  light  in  free  space: 
I'il  ^  C  (t'l  —  t[). 

5.4.10  In  the  case  of  events  related  by  a  spacelike  interval 
(sf»  <  0),  the  sign  of  t'l2  is  relative:  t'l2  >  0  (upper  part  of  hyper¬ 
bola  III  in  Fig.  5.3)  in  certain  inertial  frames  K',  whereas 
in  others  t'l2  <  0  (lower  part  of  hyperbola  III).  Point  C  corres¬ 
ponds  to  the  frame  K'  in  which  t[2  =  0,  i.e.  events  1  and  2  occur 
simultaneously. 


5.5  Transformation  of  Velocities  and  Accelerations 
in  Relativistic  Kinematics 


5.5.1  The  velocities  v  and  v'  of  a  particle  in  two  inertial  frames 
of  reference  K  and  K'  are 

't  =  ^-  =  vxi  +  vyj  +  vzk  and  v'  =  -j^r=i4'i'  +  t>y'j'  +  v'z'k' 


where  r  =  xi  +  y)  -f  zk  and  r'  =  x'M  +  y'  j'  +  z'k'  are  the 
radius  vectors  of  the  particle  in  frames  K  and  K' .  The  projections 
of  velocities  v  and  v'  on  the  axes  of  Cartesian  coordinates  of 
frames  K  and  K'  are 


t>z'  = 


dx 

dts 

dz' 

~dF 


dy 

dt 


dz 

It  ’ 


dy' 


vy'  dt' 


and 


If  the  like  axes  of  the  Cartesian  coordinates  of  frames  K'  and 
K  are  pairwise  parallel,  and  frame  K'  travels  with  respect  to 
frame  K  at  the  constant  velocity  V  along  axis  OX  (Fig.  2.3, 
see  2.8.1)  and,  in  addition,  if  at  the  zero  time  references  in 
frames  K  and  K'  (t  0  and  t'  =0),  the  origins  O  and  O'  of 
coordinates  of  these  frames  coincide,  the  Lorentz  transforma¬ 
tions  in  the  form  given  in  5.3.2  are  valid.  It  follows  from  these 
transformations  that  the  relations  between  the  projections  of 
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the  velocities  of  the  particle  on  the  axes  of  the  Cartesian  coordi¬ 
nates  of  frames  K  and  K'  are  of  the  form: 

/  vx-V  v'X'  +  V 

Vx'  I-Vvjc*  ’  Vx~i  +  V vx'lc*  ’ 

,  Vy  Y  l  —  V*/C 2  Vy'  Y 1  —  F2/c2 

Vy'  1  —  Vvx/c*  ’  Vy  l  +  Vvx'le 2 

-  vzY  1 — F2/c2  iv  /l— F2/c2 

1  —  Fi^/c2  1  +  FiWc2 

These  equations  express  the  iau>  0/  the  composition  of  velocities 
in  relativistic  kinematics.  In  the  limit  as  c  -*■  00  these  equations 
lead  to  the  ordinary  law  for  the  composition  of  velocities  in 
classical  mechanics  (2.8.2): 

v'x,  =  vx  —  V,  vy,  =  vy,  v'z,  =  vz  and  v'  =  v  —  V. 

5.5.2  The  relations  between  the  squares  of  the  magnitudes  of 
vectors  v  and  v'  are 


and 

L  (l  +  Fo*'/c2)2  J 

f  (1  u2/c2)  (1  —  F2/c2)  -1 

(  '  L  (1  —  F^/c2)2  J* 


In  particular,  •  if  i/  =  c,  then  v  =  c  and  vice  versa.  Hence, 
if  the  velocity  of  a  particle  with  respect  to  some  inertial  frame 
of  reference  is  equal  to  the  velocity  of  light  in  free  space,  then 
it  must  have  the  same  magnitude  with  respect  to  any  other 
inertial  frame  regardless  of  the  velocity  of  relative  motion  of 
these  two  frames.  In  other  words,  the  sum  of  two  velocities, 
of  which  one  is  equal  to  c,  is  always  equal  to  c.  This  law,  detect¬ 
ed  in  the  motion  of  such  elementary  particles  as  photons  (37.1.4) 
and  the  neutrino  (Table  43.1),  demonstrates  the  limiting  nature 
of  the  velocity  of  light  in  free  space  (5.1.3). 

5.5.3  It  is  evident  from  the  equations  in  the  preceding  subsec¬ 
tion  that  a  particle,  travelling  with  respect  to  some  inertial 
frame  of  reference  at  a  velocity  less  than  c,  has  a  velocity  with 
respect  to  any  other  inertial  frame  that  is  also  less  than  c  (for 
instance,  if  v  <  c,  then  v  <  c,  and  vice  versa).  From  this  it 
follows,  among  other  things,  that  no  matter  how  close  to  c 


7—0  if*  6 
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the  velocities  of  two  particles  are,  their  relative  velocity  is 
always  less  than  c.  Assume,  for  example,  that  two  particles  are 
travelling  toward  each  other  along  the  axis  OX  of  frame  K  at 
the  respective  velocities:  vt  =  0.8ci  and  v2  =  — 0.8ci.  The 
velocity  u21  of  the  second  particle  with  respect  to  the  first  is 
not  equal,  as  is  assumed  in  classical  mechanics,  to  the  vector 
difference  v2  —  vx  =  — 1.6ci  if  only  because  the  magnitude  of 
this  velocity  exceeds  c.  The  required  velocity  is  equal  to  the 
velocity  of  the  second  particle  with  respect  to  inertial  frame  A", 
travelling  together  with  the  first  particle  (V=0.8ci),  i.e.  u21=v2. 
It  follows  from  the  equation  in  5.5.1  that 

,  v2x  —  V  1.6 c  -  . 

v2x’  ~ \  —  Vv2xlc*—  “l  +  0.64~~  _~°'9  Cc’  V2y'  ~  ‘  v2zr  0  * 
i.e.  u21  =  — 0.976ci'  and  |  u21  |  <  c. 

5.5.4  The  projections  of  the  acceleration  of  a  particle  on  the 
axes  of  the  Cartesian  coordinates  of  two  inertial  frames  of  refer¬ 
ence  K  and  K'  (5.5.1)  are  related  by  the  following  equations; 

,  dv'x'  (Yi-V*lc*\3 


-V-lc'1 

Vvx\3 


_r I*  Vv* 

U  ,  Vvy  a  ]- 

0* 

H 

1 

j  a»  1  c*  x\  / 

- 1 

\ 

a  ,  V»z  a  I J 

tr  c2  ) 

c2  fix  J  J 

1  —  F2/c2 


Y 1  —  F2/c2\  3 

,  ,  Vvx' 


dvv  r  1 1  ,  vv'X'  \  -  vv'Y  ,  i 

a^=-dT=  — c^ax'j 

dvz  f  I  4  _L_  Vv'X'  \  '  VV'Z'  '  1 

1  ^nr=l(i+-sr-)az'-—ax'\ 


1 — V2/c2 


(H-^r 

1  —  F>/c2 
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5.6  Basic  Law  of  Relativistic  Dynamics 

r>.6.1  In  contrast  to  classical  mechanics,  the  mass  of  a  particle 
is  not  constant  in  relativistic  mechanics.  It  depends  upon  the 
velocity  v  of  the  particle.  Its  value  m  differs  in  two  inertial 
frames  of  reference  moving  with  respect  to  each  other.  The 
dependence  of  mass  on  velocity  is  expressed  by  the  equation: 

m0 

m=  r — •■==  , 

where  m0  is  the  rest  mass  of  the  particle,  i.e.  its  mass  measured 
in  the  inertial  frame  of  reference  with  respect  to  which  the 
particle  is  at  rest,  and  c  is  the  velocity  of  light  in  free  space. 
The  mass  m  is  frequently  called  the  relativistic  mass. 

'The  effect  of  the  velocity  of  a  particle  on  its  mass  becomes 
appreciable  only  at  values  of  v  close  to  c.  For  example,  ml m0  = 
~  1.005  at  v/c  =  0.1  and  m/m0  =  2.29  at  vie  =  0.9.  It  is 
evident  from  the  law  for  the  dependence  of  m  on  v  that  particles 
with  a  rest  mass  cannot  travel  at  velocities  greater 

Ilian  or  equal  to  c  (i.e.  v  <  c).  At  the  same  time,  particles  whose 
rest  mass  equals  zero  [photons  (37.1.4)  and  the  neutrino  (Table 
43.1)1  cannot  have  a  velocity  other  than  c. 

5.6.2  The  momentum  p  =  mv  of  a  particle  is  a  nonlinear  func¬ 
tion  of  its  velocity: 


P  = 


m0\ 

Y 1  — 


Vector  p  is  sometimes  called  the  relativistic  momentum  of  the 
particle  (to  distinguish  it  from  the  value  m0v  of  its  momentum 
in  classical  mechanics).  It  is  obvious  that  at  v  <  c,  the  mo¬ 
mentum  p=  mv  »  m0v. 

Ily  virtue  of  the  homogeneity  of  space  (2.7.1),  the  law  of  conser- 
ration  of  relativistic  momentum  is  valid  in  relativistic  mechan¬ 
ics:  the  momentum  of  a  closed  system  (2.2.4)  does  not  change 
in  the  course  of  time. 

following  from  this  law  is  the  law  of  conservation  of  relativistic 
mass:  in  any  processes  that  take  place  in  a  closed  system,  its 
total  relativistic  mass  remains  unchanged. 
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5.6.3  The  basic  law  oj  relativisiic  dynamics  is:  the  rate  oi'  change 
in  the  momentum  of  a  particle  is  equal  to  the  force  F  acting 
on  it,  i.e. 

ip=F,  or  A-  ( m°v _ )_F 

Note.  If  the  particle  is  subject  to  the  simultaneous  action  of 
several  forces,  F  is  understood  to  be  the  resultant  force  (2.2.2). 

5.6.4  An  element  of  work  done  by  force  F  in  the  infinitely  small 
displacement  dr  of  its  point  of  application  is 

6fF  =  (F  dr)  =  (Fv)  dt. 


Combining  the  basic  law  of  relativistic  dynamics  (5.6.3)  and 
the  dependence  of  mass  on  the  velocity  (5.6.1)  we  obtain 


v  -  dv  ,  dm 

F==m^r+V^r 


and 


dm 


dv 


dt 


v 2  dt 


Hence 

6 W  —  (Fv)  dt  —  m  (v  dv)  +  i Pdm  —  mv  dv  +  v2dm  =  c2dm. 
5.6.5  The  acceleration  imparted  to  a  particle  by  the  force  F  is 


dv  _  F  v  dm 
dt  m  m  dt 


Hence,  in  contrast  to  classical  mechanics,  the  acceleration  of 
a  particle  in  relativistic  mechanics  does  not,  in  general,  coincide 
in  direction  with  the  force  imparting  the  acceleration.  Vector  a 
is  collinear  with  force  F  in  only  two  cases: 

(a)  when  force  F  is  perpendicular  to  the  velocity  v  of  the  particle 
(transverse  force)  so  that  (Fv)  =  0  and 


(b)  when  force  F  is  parallel  to  the  velocity  vector  v  of  the  par¬ 
ticle  (longitudinal  force)  so  that  v  (Fv)  =  v2F  and 

F  (  ,  v2  \  F  (  .  v2  N  3/2 
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A  longitudinal  force  imparls  to  a  particle  an  acceleration  less 
by  a  factor  of  1  —  v2Ic2  than  that  imparted  by  a  transverse  force 
of  the  same  magnitude.  This  is  due  to  the  fact  that  a  transverse 
force  changes  the  velocity  of  a  particle  only  in  direction  (the 
magnitude  v  of  the  velocity  and  the  relativistic  mass  are  not 
changed),  whereas  a  longitudinal  force  changes  the  magnitude 
of  the  velocity  of  a  particle  as  well  as  its  mass. 


5.7  Mass-Energy  Relation 

5.7.1  The  increment  of  the  kinetic  energy  Eh  of  a  particle  is 
equal  to  the  work  done  by  the  force  F  acting  on  the  particle 
(5.6.4): 

=  8  IF  =  c2  dm, 

where  dm  is  the  corresponding  increment  of  relativistic  mass 
of  the  particle.  It  follows  that 

£k= (m-m0) c2 = m„c2  [  71=fp~  i]  ’ 


where  m„  is  the  rest  mass  of  the  particle.  Expanding  (\—v2lc2)~1/i 
into  a  Maclaurin  series  we  obtain 


At  v  <£  c  this  equation  reduces  to  the  ordinary  expression  for 
the  kinetic  energy  in  classical  mechanics: 

p  m0L!2  mv2 

k  q  o  " . 


5.7.2  It  follows  from  the  first  equation  in  the  preceding  sub- 
-ection  that  an  increase  in  the  kinetic  energy  of  a  body  should 
bo  accompanied  by  a  corresponding  increase  in  its  relativistic 
mass  m: 

1 

—  dEfr. 


Changes  in  other  kinds  of  energy  of  a  body  are  also  'associated 
with  an  increase  in  its  relativistic  mass.  If,  for  example,  in 
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heating  a  body  at  rest  its  internal  energy  (9.1.2)  is  increased 
by  the  amount  dU,  the  mass  m  of  the  body,  equal  to  its  rest 
mass  m0,  is  increased  by 

1 

dm  =  dmJ  —  —ir  dU . 

“  c2 

In  the  general  case,  a  change  in  the  total  energy  E  of  the  body 
by  the  amount  dE  is  accompanied  by  a  change  in  its  relativistic 
mass  by  the  amount 

1 

dm  =  — r-  dE. 

Accordingly,  the  following  universal  relation  exists  between  E 
and  m: 


E  =  me2  = 


y  i — n2/c2 


this  is  known  as  the  mass-energy  relation:  the  total  energy  of 
a  body  (or  system)  is  equal  to  the  product  of  the  relativistic 
mass  of  the  body  (or  system)  by  the  square  of  the  velocity  of 
light  in  free  space. 

By  virtue  of  the  uniformity  of  time  (3.4.2),  the  law  of  the  con¬ 
servation  of  energy  is  valid  in  relativistic  mechanics  as  it  is  in 
classical  mechanics:  the  total  energy  of  a  closed  system  (2.2.4) 
does  not  change  in  the  course  of  time. 

It  follows  from  the  mass-energy  relation  that  the  laws  of  the 
conservation  of  relativistic  mass  (5.6.2)  and  of  the  total  energy 
are  not  independent  laws. 

5.7.3  The  total  energy  of  a  particle  or  a  system  of  particles  (for 
instance,  of  an  atomic  nucleus,  atom,  molecule  or  body)  at 
rest  is  equal  to 


E  o  =  m0c2, 

where  m0  is  the  rest  mass;  E0  is  called  the  rest  energy  of  the 
particle  or  system.  The  values  of  m0  and  E0  are  independent 
of  the  choice  of  the  inertial  frame  of  reference.  These  quantities 
are  constant  characteristics  of  structureless  (elementary)  par¬ 
ticles,  similar,  for  example,  to  their  electric  charge  and  spin 
(Table  43.1).  The  rest  mass  and  rest  energy  of  a  system  of  par¬ 
ticles  depend  upon  the  composition  of  the  system  and  its  inter¬ 
nal  state.  For  example,  the  rest  mass  of  an  excited  nucleus  (or 
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atom)  is  greater  than  the  rest  mass  of  the  same  nucleus  (or  atom) 
in  the  ground  state. 

5.7.4  The  total  energy  £  of  a  particle  and  its ‘momentum  p 
are  related  by  the  equations: 

Tp  T?  2  _ 

p  =  v  or  — - p2  =  m jca  or  E=Y P2c2  +  rage4  . 

The  values  of  the  total  energy,  relativistic  mass  and  momentum 
of  a  given  particle,  in  contrast  to  its  rest  mass  m0,  are  relative, 
i.e.  they  are  different  in  two  inertial  frames  of  reference  K 
(E,  m  and  p)  and  K'  (£',  m'  and  p').  But  the  difference  between 
the  square  of  the  total  energy  of  a  particle  divided  by  c2  and 
the  square  of  the  momentum  of  the  particle,  like  the  interval 
between  two  events  (5.4.8),  does  not  depend  upon  the  choice 
of  the  inertial  frame  of  reference: 

-^--(PT=-^—Pi  =  rny. 


5.7.5  In  going  over  from  one  inertial  frame  K  to  another  A ' 
that  is  travelling  with  respect  to  the  first  at  the  velocity  V  = 
—  const  along  the  OX  axis  (Fig.  2.3,  see  2.8.1),  the  projections 
of  the  momentum  of  the  particle  on  the  coordinate  axes  and 
its  total  energy  are  transformed  as  follows: 


E-Vpx 

Y 1  —  F2/c2 


E'+VpY 
Y  i  -y«/c2  ‘ 


5.7.6  It  does  not  at  all  follow  from  the  laws  of  conservation  of 
relativistic  mass  (5.6.2)  and  of  total  energy  (5.7.2)  that  the  rest 
mass  and  rest  energy  of  a  closed  system  cannot  change.  For 
instance,  the  sum  of  the  rest  masses  of  free  protons  and  neutrons 
is  always  greater  than  the  rest  mass  of  the  atomic  nucleus  they 
form. 
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A  quantity  introduced  to  characterize  systems  held  together 
with  a  margin  of  safety  (for  example,  atomic  nuclei,  atoms  and 
molecules)  is  the  concept  of  the  binding  energy.  The  binding 
energy  of  a  system,  is  measured  as  the  least  amount  of  work  that 
must  be  done  to  separate  the  system  into  its  component  parts 
(for  instance,  an  atom  into  its  nucleus  and  electrons).  The 
binding  energy  of  a  system  is 

n 

£b=  T]  w0;c2  —  Af0c2, 
i=l 

where  M0  is  the  rest  mass  of  a  system  of  n  particles,  and  moi 
is  the  rest  mass  of  the  ith  particle  in  the  free  state.  The  quantity 

n 

2=1 

is  sometimes  called  the  mass  defect  of  the  system. 


CHAPTER  6  GRAVITATION 


6.1  Law  of  Universal  Gravitation 


6.1.1  Isaac  Newton's  law  of  universal  gravitation  states  that  the 
force  of  mutual  attraction  between  any  two  particles  is  directly 
proportional  to  the  product  of  their  masses  and  inversely  pro¬ 
portional  to  the  square  of  the  distance  between  them.  Such 
forces  are  called  gravitational  forces.  If  mt  and  ra2  are  the  masses 
of  the  particles  being  considered  and  rx  and  r2  are  the  radius 
vectors  of  these  particles,  then,  according  to  the  law  of  univer¬ 
sal  gravitation,  the  force  F12  attracting  the  first  particle  toward 
the  second  particle,  and  the  force  F21  attracting  the  second 
particle  toward  the  first  particle  are  equal  in  magnitude  and 
opposite  in  direction;  they  are  equal  to: 


Fi.=  — G  r,2  and 


r?  a 


F2i  = 


r>  _ 

^  r2l* 

*21 
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I  tore  r12  —  iq  —  r2  and  r21  =  r2  —  rx  are  radius  vectors  from 
the  second  particle  to  the  first  and  from  the  first  to  the  second, 
respectively,  and  rI2  =  r21  =  |  r12  |  =  |  r21  |  is  the  distance 
between  the  particles.  The  proportionality  factor  G  is  called  the 
constant  of  universal  gravitation,  or  the  gravitational  constant. 
In  value  the  gravitational  constant  is  equal  to  the  force  of 
mutual  attraction  of  two  particles  offunit  mass  with  a  distance 


Fig.  6.1 


of  unit  length  between  them.  It  has  been  found  from  experi¬ 
ments  that 

G  =  (6.6720  ±  0.0041)  X  10-“  (N-m2)/kg2. 


6.1.2  The  gravitational  interaction  of  two  bodies  of  arbitrary 
size  and  shape  (Fig.  6.1)  is  described  by  the  equation 


~G  f  p1dVl 

(Vt) 


f  3~T12dV2 
.’  '12 
(V2) 


where  r12  is  the  radius  vector  from  the  infinitely  small  element 
<IV2  of  volume  of  the  second  body  to  the  infinitely  small  el¬ 
ement  dV1  of  volume  of  the  first  body,  and  p2  are  the  densities 
of  the  two  elements,  and  integration  is  carried  out  throughout 
t  lie  whole  volume  of  both  bodies. 

The  calculations  of  force  Fj2  are  substantially  simplified  in 
two  cases: 

(a)  when  there  is  a  spherically  symmetrical  distribution  of  the 
masses  in  the  two  bodies,  i.e.  both  bodies  are  of  spherical  shape 
and  the  density  of  each  body  depends  only  upon  the  distance 
to  its  centre  (in  particular,  the  bodies  can  be  homogeneous); 
(l>)  when  one  of  the  bodies  is  of  negligibly  small  size  with  re- 
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spect  to  the  other  body  in  which  the  distribution  of  its  mass  is 
spherically  symmetrical. 

In  these  cases 


F12  = 


r3 

"12 


where  m1  and  m2  are  the  masses  of  the  bodies,  and  ri2  is  the  ra¬ 
dius  vector  connecting  the  centres  of  mass  (2.3.3)  of  the  second 
and  first  bodies. 

6.1.3  As  a  first  approximation,  it  can  be  assumed  that  the  earth 
is  of  spherical  shape  and  the  distribution  of  its  mass  is  spherical¬ 
ly  symmetrical.  Hence  the  gravitational  force  F,  exerted  by  the 
earth  on  a  body  of  mass  m,  is  directed  toward  the  centre  of  the 
earth  and  its  magnitude  is 


f_q  mMv 


where  M®  is  the  mass  of  the  earth  and  r  is  the  distance  from 
the  body  to  the  earth’s  centre  (the'size  of  any  body  on  earth 
is  negligibly  small  compared  to  the  radius  of  the  earth). 

6.1.4  In  application  to  such  microscopic  objects  as  elementary 
particles,  gravitational  interaction  is  practically  of  no  import¬ 
ance  whatsoever.  It  is  found  to  be  extremely  weak  in  comparison 
to  all  other  kinds  of  interaction:  strong,  electromagnetic  and 
weak  interaction  (43.2.5  through  43.2.8).  For  example,  the 
electric  force  of  mutual  repulsion  of  two  electrons  exceeds  their 
gravitational  force  by  over  1042  times!  Even  for  ordinary  macro¬ 
scopic  objects  on  the  earth  the  forces  of  gravitational  inter¬ 
action  are  extremely  small.  Two  homogeneous  spheres,  for 
instance,  with  a  mass  of  1000  kg  each  and  a  distance  of  one 
metre  between  their  centres  are  attracted  to  each  other  with 
a  force  of  only  7  X  10~5  N. 

But,  at  the  same  time,  gravitational  forces  are  the  determining 
factors  in  the  motion  of  objects  that  are  investigated  in  astron¬ 
omy  and  astronautics  (spaceships,  planets  and  their  satellites, 
planetary  systems,  stars,  etc.).  This  is  due,  primarily,  to  the 
huge  size  of  astronomical  bodies  and,  in  the  second  place,  to 
the  small  electromagnetic  forces  of  interaction  of  such  bodies, 
which,  as  a  whole,  are  practically  electroneutral. 
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6.2  Gravitational  Field 


0.2.1  Gravitational  interaction  between  bodies  is  due  to  the 
gravitational  field  set  up  by  the  bodies.  A  distinctive  feature  of 
m  gravitational  field  is  that  a  particle  placed  in  this  field  is 
subject  to  a  force  proportional  to  the  mass  of  the  particle. 

The  force  characteristic  of  a  gravitational  field  is  its  intermit/ /, 
I  lie  vector  quantity  gf,  which  is  equal  to  the  ratio  of  the  force  F, 
exerted  on  the  particle  by  the  field  in  which  it  is  placed,  to 
I  lie  mass  m  of  the  particle: 


F 


The  intensity  of  a  gravitational  field  is  independent  of  the  mas.- 
in  of  the  particle.  It  is  a  function  of  the  coordinates  {x,  y,  z) 
of  points  of  the  field  being  considered.  In  the  case  of  an  unsteady 
field  the  intensity  also  depends  upon  the  time  t. 

A  gravitational  field  is  steady,  or  stationary  (2.2.1),  if  the  body 
l  hat  sets  it  up  is  fixed  in  the  frame  of  reference  chosen  to  describe 

I  lie  field.  The  intensity  of  a  steady  gravitational  field  depends 
"illy  upon  the  coordinates:  gf  =  gf  ( x ,  y,  z). 

II  follows  from  Newton’s  second  law  (2.4.3)  that  the  forces  of 
.1  gravitational  field  impart  an  acceleration  a  to  a  free  particle 
in  the  field.  This  acceleration  equals  the  intensity  of  the  field: 

F 


b.2.2  It  follows  from  the  law  of  universal  gravitation  (6.1.1) 
l hat  the  intensity  of  the  gravitational  field  of  a  fixed  particle 
of  mass  M,  located  at  the  origin  of  coordinates,  equals 


w  here  r  is  the  radius  vector  of  the  point  being  considered  in  the 
field. 

\  gravitational  field  is  a  conservative,  or  potential,  one  (3.1.6) 
because  the  force  acting  on  a  particle  of  mass  m  brought  into 
I  he  field  is  a  central  force  (3.3.4); 


I  mgf 


_  mM  r 

(j  — - - 


r 
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Correspondingly,  the  potential  energy  of  a  particle  in  such 
a  field  is  equal  to  (3.3.4):* 

£p  =  j  Ft  dr  --  — GmM  j  -  G  . 

r  r 

It  is  equally  correct  to  regard  the  quantity  f?p  as  the  potential 
energy  of  a  particle  of  mass  M  in  a  gravitational  field  set  up  by 
a  particle  of  mass  m  or,  finally,  as  the  mutual  potential  energy 
of  two  particles  due  to  their  gravitational  interaction. 

6.2.3  A  gravitational  field  complies  with  the  principle  of  super¬ 
position  of  fields:  in  superposing  several  ( n )  gravitational  fields 
their  intensities  at  each  point  in  space  are  added  vectorially, 
i.e.  the  intensity  of  the  resultant  field  is 

n 

gt=  2 

i=l 

where  g is  the  intensity  of  only  the  ith  field  at  the  point  being 
considered  in  space. 

The  intensity  of  the  gravitational  field  of  an  arbitrary  system 
consisting  of  n  fixed  particles  is 

gf=  _G  2 

i=t 

where  p;  =  r  —  r;  is  the  radius  vector  from  the  ith  particle, 
whose  radius  vector  is  equal  to  rt,  to  the  point  in  the  field  spec¬ 
ified  by  radius  vector  r.  Consequently,  the  potential  energy  of 
a  particle  of  mass  m  in  this  gravitational  field  is 


*  Here  and  throughout  Sects.  6.2  and  6.3  an  infinitely  distant 
point  is  chosen  as  the  reference  point  for  the  potential  energy, 
i.e.  it  is  assumed  that  £p  (oo)  =  0. 
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I  ii  particular,  if  the  gravitational  field  is  set  up  by  a  body  whose 
mass  M  is  distributed  in  a  spherically  symmetrical  manner 
(0.1.2),  then  outside  this  body 


_  M  ,  „  GmM 

and  £p  = - — 


where  r  is  a  radius  vector  from  the  centre  of  the  body  to  the 
point  being  considered  in  the  field.  These  equations  are  valid, 
for  example,  for  the  gravitational  field  of  the  earth. 

0.2.4  By  virtue  of  the  conservative  nature  (potentiality)  of  the 
gravitational  field  (3.1.6),  its  energy  characteristic,  the  potential, 
ran  be  introduced.  The  potential  of  a  gravitational  field  is  the 
scalar  ip,  equal  to  the  ratio  of  the  potential  energy  Ep  of  a  par¬ 
ticle,  placed  at  the  point  being  considered  in  the  field  to  the 
mass  m  of  the  particle: 


m  * 


The  potential  cp  is  independent  of  the  mass  m  of  the  particle; 
it  is  a  function  of  the  coordinates  of  points  of  the  gravitational, 
field.  For  example,  the  potential  of  a  gravitational  field  set  up 
by  a  fixed  particle  of  mass  M  is 


•I  = 


GM 


where  r  is  the  distance  from  the  source  of  the  field  to  the  point 
being  considered. 

The  potential  of  a  gravitational  field  set  up  by  an  arbitrary 
system  of  n  fixed  parti^les^s 


,  _  V 

/-l 

i=l 


Pi 


where  is  the  distance  from  a  particle  of  mass  mt  to  the  point 
being  considered  in  the  field.  Thus,  in  superposing  gravitation¬ 
al  fields,  their  potentials  are  added  together  algebraically, 
i.e.  the  potential  (p  at  any  point  of  a  resultant  field  is  equal  to 
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the  algebraic  sum  of  the  potentials  at  the  same  point  lor  all 
the  superposed  fields  taken  separately; 
n 

<P=  V:  <Pi- 

i— I 

Note.  In  applying  this  equation  it  is  necessary  that  the  same 
reference  point  be  chosen  lor  the  potentials  <pj  of  all  the  fields 
being  superposed:  cpj  (oo)  =  0  (see  the  footnote  to  Sec.  6.2.2). 
6.2.5  An  element  of  work  done  by  the  forces  of  a  gravitational 
field  in  an  infinitely  small  displacement  dr  of  a  particle  of  mass  m 
in  this  field  is: 

dW  —  (F  dr)  —  m  (gf  dr). 

But,  on  the  other  hand,  the  work  dW  is  equal  to  the  decrease 
in  the  potential  energy  of  the  particle  in  the  gravitational 
field: 

dW  =  — dE p  =  — m  dtp. 

Consequently,  the  potential  and  intensity  of  a  gravitational 
field  are  related  by  the  equation: 


*P  =  —  (gf  dr)  =-  —  (gfx  dx  +  dy  +  gtz  d z), 


where  g[x,  gfy  and  gfz  are  the  projections  of  vector  gf  on  the 
axes  of  rectangular  Cartesian  coordinates.  Since 


d(f=^Ldx+^dy+^LdZi 

then 
d(f 


dx 

Hence 


d<P  ,  5® 

and 


=  -  (-S'  i+17  j+-&k)  =  “Srad 


dtp 


i.e.  the  intensity  of  a  gravitational  field  is  numerically  equal 
and  opposite  in  direction  to  the  gradient  of  the  potential  of 
the  field. 
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t  he  relation  between  9  and  gf  can  also  be  written  in  the  form 

dro 

</<p=  —  d/cosa  =  —  gn  dl  or  g u= - 

where  a  is  the  angle  between  vectors  gf  and  dr,  dl  —  \  dr  \, 
and  gfi  is  the  projection  of  vector  gf  onto  the  direction  of  vec¬ 
tor  dr.  Thus,  the  projection  of  the  intensity  vector  of  a  gravita¬ 
tional  field  onto  some  direction  is  numerically  equal  and  opposite 
in  sign  to  the  change  in  the  field  potential  per  unit  length  in 
the  same  direction. 

(i.2.6  The  nonrelativistic  theory  of  gravitation  discussed  above 
and  based  on  Newton’s  law  of  universal  gravitation  is  only 
approximate.  It  describes  with  sufficient  accuracy  only  compar¬ 
atively  weak  gravitational  fields,  having  a  potential  |  (p  |  <  c2, 
where  c"  =  9  X  1010  m2/s2  is  the  square  of  the  velocity  of  light 
in  free  space.  It  is,  in  particular,  applicable  to  the  gravitational 
fields  of  the  earth  and  sun  because  the  absolute  values  of  the 
potentials  of  these  fields  at  the  surfaces  of  the  earth  and  sun 
are  equal  to  0.3  X  107  and  1.9  X  10u  m2/s2. 

0.2.7  The  modern  (relativistic)  theory  of  gravitation,  which  is 
a  unified  theory  of  space,  time  and  gravitation,  was  formulated 
hy  Albert  Einstein.  lie  called  it  the  general  theory  of  relativity. 
The  existence  of  a  close  relationship  between  space  and  time 
had  already  been  indicated  in  his  special  theory  of  relativity. 
This  relationship  is  evident  in  the  Lorentz  transformations 
(h.3.2)  and  the  invariance  of  the  interval  between  two  events 
(■'i.4.8).  It  was  found  necessary,  in  describing  physical  processes, 
to  employ  a  four-dimepsional  space-time  continuum,  in  which 
I  lie  position  of  a  poirft  is  specified  by  three  space  coordinates 
mid  the  time  coordinate  ct. 

According  to  the  relativistic  theory  of  gravitation,  the  geo¬ 
metrical  properties  (metric)  of  the  space-time  continuum 
depend  upon  the  distribution  of  gravitating  masses  and  their 
motion  in  space.  Bodies  setting  up  a  gravitational  field  “distort” 
real  three-dimensional  space  and  differently  change  the  passage 
of  lime  at  various  points  in  this  space.  This  means  that  the 
hollies  cause  a  deviation  of  the  metric  of  space-time  from  that 
id  the  “flat”  space-time  described  by  Euclidean  geometry  and 
dealt  with  in  the  special  theory  of  relativity.  Therefore,  the 
motion  of  a  body  in  a  gravitational  field  can  be  regarded  as 
inertial  motion,  but  in  a  “distorted”  (non-Euclidean)  space- 
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time  continuum.  Accordingly,  a  particle,  subject  to  the  action 
of  a  gravitational  field,  travels  neither  in  a  straight  line  nor 
at  uniform  velocity  in  a  real  three-dimensional  space. 

It  was  shown  in  the  relativistic  theory  of  gravitation  that  the 
principle  of  superposition  (6.2.3)  does  not  hold  for  arbitrary 
gravitational  fields.  This  principle,  as  well  as  the  whole  non- 
relativistic  theory  of  gravitation,  is  sufficiently  accurate  only 
in  weak  fields  (|  <p  |  <  c2)  and  for  motion  in  these  fields  at  low 
velocities  v  <  c. 


6.3  Kepler's  Laws.  Space  Velocities 

6.3.1  The  motion  of  the  planets  of  the  solar  system  along 
their  orbits  around  the  sun  complies  with  the  three  laws  of 
Kepler.  These  laws  can  be  derived  from  Newton’s  law  of  uni¬ 
versal  gravitation,  dealing,  as  a  first  approximation,  with 
the  sun  and  planets  as  particles.  In  the  central  force  field  of 
gravitation  of  the  sun,  a  planet  of  mass  m  is  subject  to  the 
action  of  the  gravitational  force 


F  = 


c  mM s 


r, 


where  is  the  mass  of  the  sun  and  r  is  the  radius  vector  of 
the  planet  from  the  centre  of  forces  O,  which  is  taken  as  the 
origin  of  coordinates. 

The  moment  of  force  F  about  the  centre  of  forces  is  M  = 
=  [rFJ  =  0.  Hence,  the  angular  momentum  L  of  the  planet 
about  the  same  point  O  does  not  change  with  time  (4.3.1): 
L  =  [rmv]  =  const. 

Consequently,  the  planet  travels  along  a  plane  path,  (orbit), 
whose  plane  is  perpendicular  to  vector  L.  According  to  Sect.  1.3.6, 
L  =  [rmvq,],  where  v»  is  the  transverse  velocity  of  the  planet. 
Therefore,  the  orbital  motion  of  the  planet  satisfies  the  con¬ 
dition 


r2  —  const, 

at  m 


where  r  and  tp  are  the  polar  coordinates  of  the  planet. 
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A  second  condition  is  imposed  by  the  law  of  conservation 
of  mechanical  energy:  +  Ev  E  =r  const.  According  to 

1.3.6  and  6.2.2 


A’k 


K(£)M^)2] 


l)2] 


and 

,,  „  mMs 

'■  p  —  —  — ~ —  » 

so  that  the  second  condition  is  of  the  form 

/  dr  \2  /  L  \2  2GAfs_  2 E 

V  dt  /  '  V  mr  )  r  m 

6.3.2  The  equation  of  the  path  of  the  planet  (in  the  polar  coor¬ 
dinates  r  and  q>)  is 


1+eCOS  cp  ’ 

where  p  =  L2/Gm2Ms  and  e  =  / (2£L2/G2msM|)  +  1.  The 
total  mechanical  energy  of  the  planet  E  <  0,  so  that  c  <  1 
and  the  path  (orbit)  of  the  planet  is  an  ellipse. 

Kepler  s  first  law  states  that  all  the  planets  of  the  solar  system 
I  ravel  along  elliptic  orbits  of  which  the  sun  occupies  one  focus. 
It  follows  from  the  first  condition  (6.3.1)  that  the  areal  ve¬ 
locity  (1.3.6)  of  the  planet  is  constant: 


a  — 


=  const 


Kepler's  second  law  states  that  the  radius  vector  of  a  planet 
sweeps  out  equal  areas  in  equal  lengths  of  time. 

6.3.3  According  to  Kepler’s  second  law,  the  orbital  period,  or 
period  of  revolution,  T  of  a  planet  about  the  sun  is  equal  to 


N  -0186 


114 


GRAVITATION  [6 


the  ratio  of  the  area  A  of  the  orbit  to  the  areal  velocity  a: 

„  A  nab 
~  a  ~  a  ’ 


where  a  ~  pi  ( 1  —  e2)  and  b  =  a  ]/"  1  —  e2  are  the  major  and 
minor  semiaxes  of  the  elliptic  orbit.  Thus 


n2p  4jx2 

A2 /4m2  a  ~~GMs 


a 


3' 


This  equation  is  an  expression  of  Kepler's  third  law:  the  squares 
of  the  periods  of  revolution  of  the  planets  about  the  sun  are 
proportional  to  the  cubes  of  the  major  semiaxes  of  the  elliptic 
orbits  of  those  planets. 

6.3.4  The  orbital  velocity  is  the  minimum  velocity  that  it  is 
necessary  to  impart  to  a  body  to  convert  it  into  an  artificial 
satellite  of  the  earth.  It  is  also  called  the  circular  orbital  velocity 
because  it  is  equal  to  the  velocity  of  an  artificial  satellite  in 
circular  orbit  about  the  earth  in  the  absence  of  air  friction 
of  the  atmosphere.  The  orbital  velocity  is 


where  Mp  is  the  mass  of  the  earth,  and  r  is  the  radius  of  the 
circular  orbit.  At  the  earth’s  surface  v1  =  7.9  km/s. 

6.3.5  The  escape  velocity  is  the  minimum  velocity  that  it  is 
necessary  to  impart  to  a  body  to  enable  it,  without  any  addi¬ 
tional  forces,  to  overcome  the  earth’s  gravitational  attraction 
and  become  an  artificial  satellite  of  the  sun.  It  is  also  called 
the  parabolic  velocity  because  it  launches  the  body  on  a  para¬ 
bolic  orbit  in  the  earth’s  gravitational  field  (in  the  absence 
of  air  friction  of  the  atmosphere).  The  escape  velocity 


where  r  is  the  radius  from  the  launching  site  of  the  body  to 
the  centre  of  the  earth.  At  the  earth’s  surface  v2  =  11.2  km/s. 
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0.3.6.  The  solar  escape  velocity  is  the  minimum  velocity  that 
it  is  necessary  to  impart  to  a  spacecraft,  launched  at  the  earth’s 
surface,  to  overcome  the  sun’s  gravitational  attraction  and 
leave  the  solar  system.  This  velocity  is  v3  =  10.7  km/s. 


CHAPTER  7  MOTION  IN  NONINERTIAL  FRAMES 
OF  REFERENCE 

7.1  Kinematics  of  Relative  Motion 

7.1.1  It  is  assumed  in  classical  (Newtonian)  mechanics  that 
distances  and  time  intervals  remain  unchanged  in  going  over 
from  one  frame  of  reference 
to  any  other  frame  that  is  in 
motion  in  even  the  most 
arbitrary  manner  with  re¬ 
spect  to  the  first  frame.  For 
example,  assume  that  K  is  / 
an  inertial  frame  with  Us 
origin  at  point  0*  anddS  is 
a  noninertial  frame  with  its 
origin  at  point  0  (Fig.  7.1). 

I  n  the  general  case,  the  mo- 
lion  of  frame  S  with  respect 
lo  frame  K  can  be  regarded 
as  the  sum  of  two  motions: 

Iranslational  motion  of  point 
n  the  velocity  v0  and  rotation  about  this  point  at  the 
angular  velocity  Q.  Radius  vectors  r*  and  r  of  arbitrary 
particle  M,  measured  in  frames  K  and  S,  are  related  by 
i  lie  equation 

■'*  -=  rj  -f  r, 

where  rj  is  the  radius  vector  of  point  0  measured  in  frame  of 
reference  K. 

7.1.2  The  motion  of  particle  M  with  respect  to  any  inertial 
Irame  of  reference  if,  conditionally  assumed  to  bo  fixed,  is 
i  ailed  the  absolute  motion  of  particle  M.  The  motion  of  the 
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same  particle  with  respect  to  the  noninertial  frame  of  refer¬ 
ence  S  is  said  to  be  relative  motion. 

The  relative  velocity  vr  of  particle  M,  i.e.  its  velocity  with 
respect  to  frame  S,  equals 


dx  .  ,  du  .  ,  dz 

Vr=^r,+_dT1+ dF 


k, 


where  x,  y  and  z  are  the  Cartesian  coordinates  of  particle  M, 
and  i,  j  and  k  arc  the  unit  vectors  of  the  coordinate  axes  in 
frame  S. 

The  absolute  velocity  of  particle  M,  i.e.  its  velocity  v  in  the 
frame  of  reference  K,  equals 


dr* 

dt 


di 


dj 


dk 


=vo +Xw+y^r+z-w+^’ 


where  v0  =  dr$/dt  is  the  absolute  velocity  of  point  O.  Since 
the  unit,  vectors  of  moving  frame  S  can  change  in  frame  of 
reference  K  only  due  to  the  rotation  of  frame  S  about  point  0 
at  the  angular  velocity  Q, 

3--W1 

and 

V  =  Vt  +  vr, 

where  vt  =  v0  +  [Qr]  is  the  velocity  oj  transportation  of  par¬ 
ticle  M.  It  is  also  called  the  bulk,  or  drijt,  velocity  and  is  equal 
to  the  absolute  velocity  of  the  point  of  moving  frame  of  ref¬ 
erence  S  (i.e.  rigidly  fixed  in  this  frame)  in  which  particle  M 
is  located  at  the  given  instant  of  time. 

7.1.3  The  relative  acceleration  ar  of  particle  M  (its  acceleration 
with  respect  to  frame  S)  equals 

d2x  .  d2y  .  diz  , 
ar=-dFr,+  lprH  ^k- 

The  absolute  acceleration  of  particle  M,  i.e.  its  acceleration 
with  respect  to  frame  K,  equals 

dv  .  , 

a___  =  at  +  ac  +  ar. 
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“•'Tr+tlH  +  l810'!! 

is  the  acceleration  of  transportation  of  particle  M.  It  is  equal 
to  the  absolute  acceleration  of  the  point  in  moving  frame  S 
in  which  particle  M  is  located  at  the  given  instant  of  time,  and 

aC 2  [S2vr  ] 

is  the  Coriolis  acceleration  of  particle  M.  The  Coriolis  accelera¬ 
tion  has  its  maximum  value  if  the  relative  velocity  vr  of  the 
particle  is  perpendicular  to  the  vector  £2  of  angular  velocity 
of  the  moving  frame  of  reference.  It  is  equal  to  zero  if  the  angle 
between  vectors  v,.  and  £1  equals  zero  or  n,  or  if  even  one  of 
these  vectors  equals  zero. 


7.2  Inertial  Forces 

7.2.1  Newton’s  laws  of  molioti  ard  not  valid  in  noninertial 
frames  of  reference.  In  particular,  a  particle  can  change  the 
slate  of  its  motion  with  respect  to  noninertial  frame  S  without 
any  action  being  exerted  on  the  particle  by  other  bodies.  A  ball, 
for  instance,  suspended  by  a  string  from  the  coiling  of  a  railway 
coach  of  a  train  travelling  at  uniform  velocity  in  a  straight 
line,  deviates  backward  when  the  motion  of  the  train  is  accel¬ 
erated  and  frontward  when  the  motion  is  decelerated,  i.e.  it 
begins  to  move  with  respect  to  a  noninertial  frame  of  reference 
attached  to  the  coach.  But  no  horizontal  forces  act  on  the  ball 
when  it  deviates. 

7.2.2  The  fundamental  law  of  dynamics  for  a  particle  in  noniner- 
lial  frames  of  reference  can  be  derived  from  Newton’s  second 
law  and  the  relation  between  the  absolute  and  relative  accel¬ 
erations  of  a  particle.  It  follows  from  Sect.  7.1.3  that  the  product 
id  the  mass  m  of  a  particle  by  its  relative  acceleration  equals 

ma,,  =  m a  —  7»aj  ■ —  mac. 

According  to  Newton’s  second  law,  formulated  to  apply  to  the 
absolute  motion  of  a  particle,  i.e.  its  motion  with  respect  to 
inertial  frame  K, 

in  a  -  F, 
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where  F  is  the  vector  sum  of  all  forces  acting  on  the  particle. 
Hence,  the  fundamental  equation  of  dynamics  of  a  particle's 
relative  motion  is  of  the  form: 

mar  =  F  —  mat  —  map,. 

This  equation  can  be  reduced  to  a  form  similar  to  the  funda¬ 
mental  law  of  dynamics  for  the  absolute  motion  of  a  particle: 

mav  —  F  +  It  I<> 

The  vector  quantities  It  =  — mat  and  Ic  =  — maC  have  the 
dimensionality  of  force  and  are  called  the  inertial  force  of  trans¬ 
portation  and  the  Coriolis  inertial  force ,  respectively. 

7.2.3  It  follows  from  Sect.  7.1.3  that  in  the  general  case  the 
inertial  force  of  transportation  is  the  sum  of  three  terms: 

''—T-Cf']-'" 

The  last  term  on  the  right-hand  side  of  this  equation 
Iff  =  —  rn  [Q  [£2r]] 

is  called  the  centrifugal  force  of  inertia,  or  simply  the  centrifugal 
force,  because  this  vector  is  perpendicular  to  the  instantaneous 
axis  of  rotation  (to  vector  Q)  of  the  noninerlial  frame  S  and 
its  direction  is  away  from  this  axis.  Numerically  the  centri¬ 
fugal  force  equals 

I  cl  —  mQ2  p, 

where  p  is  the  distance  from  a  particle  of  mass  m  to  the  instan¬ 
taneous  axis  of  rotation  of  the  frame  of  reference. 

The  inertial  force  of  transportation  coincides  with  the  centri¬ 
fugal  force  when  the  noninertial  frame  of  reference  travels 
with  translational  motion  at  constant  velocity  (v0  —  const) 
and  rotates  at  constant  angular  velocity  (£2  —  const). 

7.2.4  The  Coriolis  inertial  force  is 

Ic  =  2m  [vr£2]. 

This  force  acts  on  the  particle  only  when  the  noninertial  frame 
of  reference  rotates  and  the  particle  moves  with  respect  to 
this  frame.  Thus,  for  example,  particles  of  water  in  rivers 
of  the  Northern  Hemisphere  that  flow  meridionally  are  subject 
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l.o  the  action  of  a  Coriolis  inertial  force.  This  force  is  perpen¬ 
dicular  to  the  velocity  of  flow  of  the  river  and  leads  to  under¬ 
mining  the  right  bank  in  the  direction  of  flow. 

The  Coriolis  inertial  force  does  no  work  in  the  relative  motion 
oT  a  particle  because  the  force  is  perpendicular  to  the  velocity 
of  relative  motion  of  the  particle.  Hence,  the  Coriolis  inertial 
force  is  an  example  of  the  gyroscopic  forces  (3.1.7). 

7.2.5  The  action  of  inertial  forces  on  a  particle  in  a  noninertial 
frame  of  reference  are  real  and  can  be  measured  in  this  frame 
l).y,  for  example,  a  spring  dynamometer.  But,  in  contrast 
to  ordinary  forces  of  interaction  between  bodies,  it  is  impossible 
to  indicate  for  inertial  forces  the  action  of  what  specific  bodies 
on  the  particle  being  considered  they  express.  This  feature  of 
inertial  forces  is  due  to  the  fact  that  the  very  occurrence  of  the 
vector  quantities  It  and  Ic  in  the  fundamental  equation  of  the 
dynamics  of  relative  motion  results  only  from  the  noninertial 
nature  of  the  frame  of  reference  used  to  describe  the  relative 
motion  of  the  particle.  The  addition  of  the  inertial  forces  It 
and  Ic  to  the  force  F,  which  characterizes  the  action  on  the 
particle  of  all  the  other  bodies,  enables  the  fundamental  equa¬ 
tion  of  the  dynamics  of  relative  motion  to  be  written  in  a  form 
that  resembles  the  expression  for  Newton’s  second  law  in  an 
inertial  frame  of  reference. 

There  can  be  no  closed  systems  of  bodies  in  noninertial  frames 
of  reference  because  the  inertial  forces  are  always  external 
forces  for  any  of. the  bodies  in  the  system.  Therefore,  the  laws 
of  conservation  of  momentum  (2.7.1),  of  angular  momen¬ 
tum  (4.4.1)  and  of  energy  (3.4.2)  are  not  valid  in  noninertial 
frames  of  reference. 


7.3  Relative  Motion  in  a  Frame  of  Reference  Fixed 
to  the  Earth.  Gravity  Force  and  Weight  of  a  Body 

7.3.1  A  frame  of  reference  fixed  in  the  earth  is  noninertial 
for  two  reasons:  firstly,  due  to  the  diurnal,  or  daily,  rotation 
"f  the  earth  at  constant  angular  velocity  Q  (where  Q  =  2n 
rad  per  24  h  =  7.3  X  10-6  rad/s)  and,  in  the  second  place, 
due  to  the  effects  of  the  gravitational  field  of  the  sun,  moon, 
planets  and  other  astronomical  bodies  on  the  earth.  This  gravi- 
laiional  field  is  practically  uniform  within  the  limits  of  the 
earth  and  imparts  the  same  acceleration  of  translational  motion 
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to  the  terrestrial  frame  of  reference  and  all  bodies  moving  with 
respect  to  this  frame.  This  acceleration  is  a0  =  d\0ldt  =  gf, 
where  gf  is  the  intensity  of  the  field  (6.2.1). 

It  follows  from  the  equations  in  Sects.  7.2.2  and  7.2.3  that 
the  equation  of  relative  motion  of  a  particle  of  mass  m  in  a 

frame  of  reference  fixed  to 
the  earth  is  of  the  form 

mar  =  F  +  Fgr  +  Icf+  Iq, 

where  Icf  and  Ic  are  the  cen¬ 
trifugal  and  Coriolis  inertial 
forces,  Fgr  is  the  gravitation¬ 
al  attraction  of  the  earth 
for  the  particle  (6.1.3),  and 
F  is  the  sum  of  all  the 
other  forces  acting  on  the 
particle,  with  the  exception 
of  the  gravitational  forces. 
7.3.2  The  gravity  force  of  a 
body  is  the  force  P,  applied 
to  the  body  and  equal  to 
the  vector  sum  of  the  force 
Fgr  of  gravitational  attraction  of  the  earth  for  the  body  (6.1.3) 
and  the  centrifugal  inertial  force  Icf  due  to  the  diurnal 
rotation  of  the  earth  (Fig.  7.2): 

P  =  Fgr  +  ^f. 
i.e. 

P  =  r  —  m  [Q  [Qr]j- 

rJ 

Here  m  and  are  the  masses  of  the  body  and  the  earth, 
r  is  the  radius  vector  drawn  from  the  centre  of  the  earth  to  the 
place  where  the  body  is  located,  5}  is  the  angular  velocity  of 
diurnal  rotation  of  the  earth,  and  G  is  the  gravitational  con¬ 
stant  (6.1.1). 

The  gravity  force  causes  an  unsupported  or  unfastened  body 
to  fall  to  the  earth.  It  equals  the  force  with  which  the  body 
presses  against  a  horizontal  support  (or  pulls  on  a  vertical 
suspension)  due  to  the  gravitation  attraction  of  the  earth. 
It  can  be  measured  in  the  terrestrial  frame  of  reference  by 
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means  ol  a  spring  dynamometer.  The  point  of  application 
of  the  gravity  force  acting  on  a  body,  i.e.  the  point  of  appli¬ 
cation  of  the  resultant  of  all  the  gravity  forces  acting  on  all 
the  particles  of  the  body  is  called  the  centre  of  gravity  of  the 
body.  It  coincides  with  its  centre  of  mass  (2.3.3). 

7.3.3  The  gravity  force  of  a  body  is  independent  of  its  velocity 
of  relative  motion.  It  is  proportional  to  the  mass  m  of  the  body 
and  can  be  presented  in  the  form 

P  =  mg, 

where  g  is  the  acceleration  due  to  gravity,  or  frfce  fall  acceleration 
(sec  7.3.5).  At  a  given  place  on  earth,  vector  g  is  the  same  for 
all  bodies  and  changes  for  other  places.  ' 

The  gravity  force  of  a  body  coincides  with  the  gravitational 
attraction  of  the  earth  for  the  body  only  at  the  earth’s  North 
and  South  poles  because  there  the  centrifugal  inertial  force 
Icf  =  0.  The  greatest  difference  between  the  gravity  force  and 
the  gravitational  attraction  is  observed  at  the  equator,  where 
the  force  Icr  reaches  its  maximum  value  and  is  opposite  in 
direction  to  Fgr.  Even  at  the  equator,  however,  the  gravity 
force  and  the  force  of  gravitational  attraction  differ  by  only 
0.35  per  cent.  At  all  points  of  the  earth’s  surface,  except  the 
equator  and  the  poles,  forces  P  and  Fgr  fail  to  coincide  in  direc¬ 
tion  as  well  (Fig.  7.2),  but  the  maximum  angle  between  them 
does  not  exceed  6  minutes  of  arc.  The  gravity  force  decreases 
with  the  altitude.  Near  the  earth’s  surface  this  decrease  is 
approximately  0.034  per  cent  per  kilometre  above  the  surface. 

7.3.4  The  acceleration  g  near  the  earth’s  surface  varies  from 
!).78  m/s2  at  the  equator  to  9.83  m/s2  at  the  North  and  South 
poles.  This  is  due,  in  the  first  place,  to  the  dependence  of  the 
centrifugal  inertial  force  on  the  geographical  latitude  of  the 
location  and,  in  the  second,  to  the  nonspherical  shape  of  the 
earth,  which  is  slightly  flattened  at  the  poles  of  its  axis  of 
rotation,  having  the  shape  of  an  ellipsoid  of  rotation,  or  sphe¬ 
roid  (the  polar  and  equatorial  radii  of  the  earth  are  i?p  = 
=  6357  km  and  Re  =  6378  km).  The  standard  value  of  free 
fall  acceleration,  accepted  in  establishing  a  system  of  units 
and  for  barometric  computation,  is  9.80665  m/s2. 

7.3.5  The  free  fall  of  a  body  is  its  motion  due  to  the  action  of 
the  gravitational  field  alone. 

The  free  fall  acceleration  of  a  body  falling  to  the  earth,  mea¬ 
sured  in  a  noninertial  frame  of  reference  rotating  together 
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with  the  earth,  can  be  found  by  the  motion  equation  (7.3.1) 
by  putting  F  =  0,  Fgr  -|-  ICf  =  rag  and  Ic  =  2m  [vrSi].  Then 

ar  =  g  +  2  [vrQ]. 

If  vr  =  0,  ar  =  g. 

Consequently,  vector  g  is  equal  to  the  free  fall  acceleration 
of  the  body,  measured  with  respect  to  the  terrestrial  frame  of 
reference  at  the  instant  when  the  relative  velocity  of  the  body 
equals  zero.  For  this  reason,  vector  g  is  called  the  free  fall 
acceleration. 

If  the  relative  velocity  of  a  freely  falling  body  vr  0,  its 
acceleration  with  respect  to  the  earth  is  not  g:  g  =  ar  +  2  [Qvr]. 
At  velocities  vr  <  680  m/s,  however,  the  values  of  g  and  ar 
differ  by  less  than  one  per  cent.  It  can  therefore  be  assumed 
in  many  cases  that  with  respect  to  an  observer  on  the  earth 
free  fall  of  a  body  is  caused  only  by  the  gravity  force  acting 
on  and  imparting  the  acceleration  g  to  the  body.  In  a  similar 
manner  the  action  of  the  Coriolis  inertial  force  on  a  freely  falling 
body  can  be  treated  as  a  comparatively  small  perturbation. 
Thus,  for  instance,  a  freely  falling  body  is  deviated  by  the 
Coriolis  force  to  the  east  of  the  direction  assumed  by  a  plumb 
line,  i.e.  from  the  direction  of  the  vector  P  =  mg.  This  devia¬ 
tion  for  a  body  falling  freely  without  any  initial  velocity  from 
the  height  h  is  equal,  at  the  latitude  cp,  to 

s  —  Qh  l/^—  cos  cp. 

3  '  g 

If,  for  example,  h  —  160  m  and  cp  —  45°,  s  =  1.55  cm. 

7.3.6  The  weight  of  a  body  is  the'  force  Q  with  which  it  acts, 
due  to  the  earth’s  gravity,  on  a  support  or  suspension  that 
prevents  it  from  falling  freely.  It  is  assumed  that  the  body 
and  its  support  (or  suspension)  are  fixed  with  respect  to  the 
frame  of  reference  in  which  the  weight  is  determined.  The 
force  — Q  is  exerted  by  the  support  or  suspension  on  the  body. 
It  follows  from  the  fundamental  equation  of  dynamics  of  rela¬ 
tive  motion  (7.2.2),  where  ar  —  —  0  and  F  —  Fgr  —  Q, 

that 

Q  =  Fgr  +  It, 
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where  Fgr  is  the  force  of  gravitational  attraction  of  the  earth 
for  the  body,  and  It  is  the  inertial  force  of  transportation  due 
to  the  noninertial  nature  of  the  frame  of  reference. 

Example  1.  The  weight  of  a  body  in  a  frame  of  reference  fixed 
to  the  earth  is  equal  to  the  gravity  force  of  the  body  (7.3.2): 

Q  -  Fgr  +  Icf  =  P.  / 

Example  2.  The  weight  of  a  body  in  a  frame  of  reference  at¬ 
tached  to  a  lift  (elevator)  which  travels  with  translational  motion 
with  respect  to  the  earth  with  the  acceleration  a0: 

Q  =  P  —  m  a0. 

If  the  lift  falls  freely,  a„  =  g,  the  free  fall  acceleration,  and 
the  weight  of  the  Body  in  the  lift  is  Q  =  0,  i.e.  the  body  is 
in  the  state  of  weightlessness. 

7.3.7  Weightlessness,  or  zero  gravity,  is  the  state  of  a  mechan¬ 
ical  system  in  which  the  external  gravitational  field  acting 
on  the  system  does  not  cause  any  pressure  of  the  parts  against 
one  another  or  their  deformation.  Such  a  state  occurs  in  a  me¬ 
chanical  system  complying  with  the  following  three  conditions: 
(a)  no  other  external  forces,  except  those  of  the  gravitational 
iield,  act  on  the  system;  (b)  the  size  of  the  system  is  such  that 
within  its  limits  the  external  gravitational  field  can  be  regarded 
as  being  uniform;  and  (c)  the  system  is  in  translational  motion. 
The  state  of  weightlessness  is  characteristic,  for  instance,  of 
bodies  in  a  spaceship  because,  during  the  main  part  of  its 
path  in  a  gravitational  field,  it  is  in  free  flight,  i.e.  coasting 
with  the  engines  turned  off. 


7.4  Principle  of  Equivalence 

7.4.1  The  inertial  forces  acting  on  bodies  in  a  noninertial 
frame  of  reference  are  proportional  to  the  masses  of  the  bodies 
and,  other  conditions  being  equal,  impart  the  same  relative 
accelerations  (7.1.3)  to  them.  In  other  words,  all  bodies  free 
of  external  action  travel  in  an  “inertial  force  field”,  i.e.  with 
respect  to  a  noninertial  frame  of  reference,  in  absolutely  the 
same  way,  provided  that  the  initial  conditions  of  their  motion 
are  also  the  same.  A  similar  behaviour  is  observed  for  bodies 
travelling  with  respect  to  an  inertial  frame  of  reference  and 
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subject  to  the  action  of  the  forces  of  a  gravitational  field.  At 
each  point  in  the  field  these  forces,  like  the  inertial  forces,  are 
proportional  to  the  masses  of  the  bodies  and  impart  the  same 
free  fall  acceleration  to  them.  This  acceleration  is  equal  to 
the  field  intensity  (6.2.1)  at  the  point  being  considered. 

In  a  noninerlial  frame  of  reference,  for  example,  fixed  to  a  lift 
(elevator)  travelling  with  uniform  acceleration  vertically 
upward  at  the  acceleration  of  transportation  a0  =  const,  all 
free  bodies  fall  in  the  absence  of  a  gravitational  field  with  the 
same  relative  acceleration  a,.  =  — a„.  Free  bodies  behave  in 
exactly  the  same  way  in  the  same  lift  when  it  travels  at  uni¬ 
form  velocity  in  a  uniform  gravitational  field  of  the  intensity 
gf  =  —  a0.  Thus,  on  the  basis  of  experiments  conducted  on  the 
free  fall  of  bodies  within  a  tightly  closed  lift,  it  is  impossible 
to  establish  whether  the  lift  is  travelling  at  uniform  velocity 
in  a  gravitational  field  of  the  intensity  gf  =  ar  (in  partic¬ 
ular,  the  lift  may  also  be  at  rest  in  this  field)  or  whether  it 
is  travelling  with  constant  acceleration  of  transportation 
at  =  — ar  in  the  absence  of  a  gravitational  field. 

7.4.2  The  local  equivalence  principle  states  that  the  gravita¬ 
tional  field  in  a  limited  region  of  space  is  equivalent  to  an 
“inertial  force  field”  in  a  properly  chosen  nonincrtial  frame 
of  reference.  The  region  of  space  should  be  small  enough  so  that 
the  gravitational  field  in  it  can  be  assumed  to  be  uniform. 

The  equivalence  principle  is  not  to  be  understood  as  a  state¬ 
ment  of  the  identity  of  inertial  forces  and  Newton’s  gravita¬ 
tional  forces  that  act  between  bodies.  As  a  matter  of  fact,  the 
intensity  of  a  true  gravitational  field,  set  up  by  bodies,  decreases 
as  the  bodies  recede  farther  and  farther  away,  and  approaches 
zero  at  infinity.  Gravitational  fields  “equivalent”  to  iner¬ 
tial  forces  do  not  comply  with  this  condition.  For  example, 
the  intensity  of  a  gravitational  field,  “equivalent”  to  the  centri¬ 
fugal  inertial  forces  in  a  rotating  frame  of  reference  increases 
without  limit  at  points  farther  and  farther  away  from  the  axis 
of  rotation.  The  intensity  of  a  field  “equivalent”  to  the  inertial 
forces  of  transportation  in  a  frame  of  reference  having  transla¬ 
tional  motion  is  the  same  at  all  points  of  the  field. 

7.4.3  A  true  gravitational  field,  in  contrast  to  one  “equivalent” 
to  inertial  forces,  exists  in  both  nonincrtial  and  inertial  frames 
of  reference.  No  possible  choice  of  a  noninerlial  frame  of  ref¬ 
erence  can  completely  eliminate  a  true  gravitational  field, 
i.e-  compensate  for  it  throughout  space  by  means  of  a  “field 
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of  inertial  forces”.  This  follows  in  any  case,  even  if  only  from 
the  different  behaviour  of  “fields  of  inertial  forces”  and  true 
gravitational  fields  at  infinity.  A  true  gravitational  field  can 
be  eliminated  only  locally,  i.e.  for  a  small  region  of  space 
within  whose  limits  the  field  can  be  assumed  uniform,  and  for 
a  short  time  interval  during  which  the  field  can  be  assumed 
constant.  The  noninertial  frame  of  reference  corresponding  to 
this  operation  should  travel  with  an  acceleration  of  transpor¬ 
tation  equal  to  the  free  fall  acceleration  of  bodies  in  the  region 
of  the  true  gravitational  field  being  considered.  Thus,  in  a  space 
ship  in  free  flight  in  a  gravitational  field,  the  gravitational 
forces  of  attraction  are  compensated  for  by  the  inertial  forces 
of  transportation  and  do  not  lead  to  the  relative  motion  of 
bodies  in  the  ship. • 


PART  TWO  FUNDAMENTALS 
OF  MOLECULAR  PHYSICS 
AND  THERMODYNAMICS 


CHAPTER  8  IDEAL  GASES 

8.1  Subject  Matter  of  Molecular  Physics. 

Thermal  Motion 

8.1.1  The  branch  of  physics  called  molecular  physics  deals  with 
the  dependence  of  the  structure  and  physical  properties  of 
bodies  on  the  nature  of  the  motion  of  and  the  interaction  be¬ 
tween  the  particles  making  up  these  bodies. 

Molecular  physics  is  based  on  the  molecular-kinetic  theory 
of  the  structure  of  matter.  According  to  this  theory  all  bodies 
consist  of  submicroscopic  particles— atoms,  molecules  or  ions — 
in  continuous  chaotic  motion,  which  is  called  thermal  motion. 
Experimental  confirmations  of  the  molecular-kinetic  theory 
are:  the  Brownian  movement  (11.7.1),  the  transport  phenomenon 
in  various  aggregate  states  of  matter  and  other  phenomena. 

8.1.2  The  molecular-kinetic  theory  of  the  structure  of  matter 
is  used  to  advantage  in  various  branches  of  physics.  A  great 
variety  of  physical  phenomena  that  proceed  due  to  the  interac¬ 
tion  and  motion  of  particles  of  matter  are  dealt  with  in  this 
theory  from  a  unified  point  of  view.  This  theory,  for  example, 
enables  the  mechanism  of  elasticity  in  solids  (40.3.1)  to  be 
understood,  reveals  the  cause  of  internal  friction  in  gases  (10.8.4) 
and  liquids,  explains  the  difference  between  real  gases  (12.1.2) 
and  ideal  gases  (8.4.1),  and  elucidates  the  mechanism  of  elec¬ 
trical  conduction  in  conductors  of  electric  current  of  various 
kinds,  and  the  electrical  and  magnetic  properties  of  matter. 

8.1.3  The  thermal  motion  of  particles  of  matter  in  various 
states  of  aggregation  is  not  the  same.  It  depends  upon  the 
forces  of  attraction  or  repulsion  between  the  atoms,  molecules 
and  ions. 

There  are  practically  no  attractive  forces  between  the  atoms 
and  molecules  in  sufficiently  dilute  gases.  This  is  due  to  the 
fact  that  the  particles  of  such  gases  are  located  at  distances 
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from  one  another  exceeding  the  range  of  molecular  action 
(12.2.3).  The  particles  of  such  gases  travel  at  uniform  velocity 
and  in  a  straight  line  until  they  collide  with  one  another  or 
with  the  walls  of  the  container.  These  collisions  are  of  a  random 
nature.  Each  molecule  in  the  volume  of  the  gas  may  collide 
with  any  of  the  nearby  particles,  changing  the  direction  of  its 
motion  in  any  arbitrary  manner.  The  collision  of  a  molecule 

(or  atom)  of  the  gas  with  a  wall  may  occur  at  any  angle  0  (  where 


<—  J .  Asa  result,  the  thermal  motion  of  molecules 


of  gas  is  a  purely  random  one  and,  on  the  average,  the  same 
number  of  molecules  are  travelling  at  any  instant  of  time  in 
any  arbitrary  direction  within  the  gas. 

8.1.4  Solid  crystalline  bodies  are  characterized  by  considerable 
forces  of  interaction  between  their  particles  (atoms,  molecules 
and  ions).  The  joint  action  of  the  forces  of  attraction  and  repul¬ 
sion  between  the  particles  (12.1.3)  leads  to  the  vibration  of 
the  particles  of  solids  about  their  equilibrium  positions,  which 
are  called  the  sites,  or  points,  of  the  crystal  lattice  (40.1.1). 

I  ntermolecular  interaction  and  violations  of  periodicity  in 
crystals  (40.1.1)  make  these  vibrations  anharmonic  (28.1.3). 

8.1.5  The  thermal  motion  of  molecules  of  a  liquid  is  of  a  kind 
intermediate  between  the  two  preceding  types  of  motion  (8.1.3 
and  8.1.4).  For  a  definite  length  of  time  the  molecules  of  a  liquid 
vibrate  about  a. certain  equilibrium  position  and  are  in  a  tem¬ 
porarily  settled  state.  At  the  end  of  this  time  the  equilibrium 
position  of  the  molecules  is  displaced  and  a  new  settled  arrange¬ 
ment  is  formed.  Simultaneously,  a  slow  displacement  of  the 
molecules  and  their  vibrations  occur  within  small  volumes 
of  the  liquid  (see  also  Sect.  13.2.3). 


8.2  Statistical  and  Thermodynamic  Methods 
of  Investigation 

8.2.1  There  is  a  huge  number  of  atoms  (or  molecules)  in  any 
body.  For  instance,  in  1  cm3  of  a  gas  whose  properties  arc  close 
lo  those  of  an  ideal  gas  there  are  2.7  X  10’ 9  molecules  at  stand¬ 
ard  conditions.  In  the  condensed  states,  liquid  and  solid, 
the  amount  is  of  the  order  of  1022  particles/cm3.  If  we  assume 
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that  each  atom  (or  molecule)  obeys  Newton’s  second  law  (2.4.1) 
it  is  obviously  quite  impossible,  not  only  to  solve  the  dif¬ 
ferential  equations  of  motion  of  the  separate  particles,  but 
even  to  write  these  equations.  Therefore,  the  behaviour  of 
a  separate  molecule  (or  atom)  of  a  body,  for  example,  its  path 
and  the  sequence  of  changes  in  its  state,  cannot  be  investigated 
by  the  methods  used  in  classical  mechanics. 

8.2.2  The  macroscopic  properties  of  systems  consisting  of  a  very 
great  number  of  particles  are  studied  by  the  statistical  method. 
It  is  based  on  the  application  of  probability  theory  to  definite 
models  of  the  structure  of  the  systems  being  investigated. 
The  branch  of  theoretical  physics  in  which  the  physical  prop¬ 
erties  of  systems  are  studied  by  the  statistical  method  is 
called  statistical  physics  (statistical  mechanics).  Special  laws, 
known  as  statistical  laws  are  manifested  in  the  joint  behaviour 
of  large  numbers  of  particles.  Certain  average  values  of  the 
physical  quantities  can  be  found  for  a  system  consisting  of 
a  great  many  particles.  These  values  characterize  the  set  of 
particles  as  a  whole.  In  gases,  for  instance,  there  are  average 
values  of  the  velocity  of  thermal  motion  of  the  molecules  (10.3.6) 
and  of  their  energy  (10.2.4).  In  a  solid  there  is  an  average  energy 
per  degree  of  freedom  of  the  vibrational  motion  of  its  particles 
(41.7.2),  etc.  All  properties  of  a  system  of  particles  are  due, 
not  only  to  the  individual  properties  of  the  particles  themselves, 
but  to  the  features  of  their  joint  motion  and  average  values  of 
their  dynamic  characteristics  (average  velocities,  average 
energies,  etc.). 

In  addition  to  statistical  laws,  there  are  also  dynamical  laws, 
describing  the  motion  of  separate  particles.  The  relationship 
between  dynamical  and  statistical  laws  is  manifested  in  the 
fact  that  the  laws  of  motion  of  individual  particles  have  an 
effect  of  the  description  of  a  system  of  particles,  which  is  being 
investigated  by  the  statistical  method. 

8.2.3  Another  procedure  for  studying  physical  phenomena 
is  the  thermodynamic  method,  which  is  not  concerned  with 
either  the  internal  structure  of  the  substances  making  up  the 
bodies  (or  systems)  being  investigated  or  with  the  nature  of 
the  motion  of  the  separate  particles.  The  thermodynamic 
method  is  based  on  the  study  of  the  various  conversions  of 
energy  that  occur  in  a  system.  The  conditions  of  these  conver¬ 
sions  and  the  relations  between  the  various  kinds  of  energy 
enable  the  physical  properties  of  systems  to  be  studied  in 
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a  great  variety  of  processes  in  which  the  systems  participate. 
The  branch  of  physics  in  which  research  in  the  physical  prop¬ 
erties  ol'  systems  is  conducted  by  means  ol  the  thermodynamic 
method  is  called  thermodynamics  ( phenomenological  thermody¬ 
namics).  Thermodynamics  is  based  on  two  experimentally 
established  principles,  or  laws,  of  thermodynamics  (9.3.1  and 
11.3.2),  as  well  as  the  Nernst,  heat  theorem,  or  third  law  of 
thermodynamics  (11.8.2). 

8.3  Thermodynamic  Variables.  Equations  of  State. 
Thermodynamic  Processes 

8.3.1  Thermodynamics  deals  with  thermodynamic  systems.  There 
are  macroscopic  items  (bodies  and  fields)  that  can  interchange 
energy  with  one  another  as  well  as  with  the  external  medium, 
i.e.  bodies  and  fields  that  are  external  with  respect  to  the 
given  system. 

8.3.2  The  physical  quantities  introduced  to  describe  the  state 
of  a  thermodynamic  system  are  called  thermodynamic  variables, 
or  parameters  (state  variables),  of  the  system.  Usually  the  pres¬ 
sure,  specific  volume  and  temperature  are  taken  as  the  thermo¬ 
dynamic  variables. 

The  pressure  p  is  a  physical  quantity  equal  to  the  force  acting 
on  unit  area  of  a  body’s  surface  in  the  direction  normal  to 
the  surface: 


where  dl'n  is  the  numerical  value  of  the  normal  force  acting 
on  an  infinitely  small  part  of  the  surface  of  the  body,  the  area 
of  this  part  being  dA . 

The  specific  volume  v  is  the  reciprocal  of  the  density  p  of  a  body: 
n  =  1/p.  The  specific  volume  of  a  homogeneous  body  is  equal 
to  the  volume  of  unit  mass  of  the  body. 

8.3.3  The  concept  of  temperature  is  of  significance  for  equi¬ 
librium  states  of  a  thermodynamic  system  (8.3.4).  An  equilib¬ 
rium  state  (state  oj  thermodynamic  equilibrium)  is  a  state  of 
a  system  which  does  not  change  with  time  (steady  state),  the 
steady  nature  of  the  state  not  being  due  to  processes  occurring 
in  the  external  medium.  An  equilibrium  state  is  set  up  in 
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a  system  subject  to  constant  external  conditions  and  is  main¬ 
tained  in  the  system  for  an  arbitrary  length  of  time.  The  tem¬ 
perature  is  the  same  in  all  parts  of  a  thermodynamic  system 
that  is  in  a  state  of  thermodynamic  equilibrium.  If  two  bodies 
with  different  temperatures  come  into  contact,  internal  ener¬ 
gy  (9.1.2)  is  transferred  from  the  body  at  the  higher  temperature 
to  that  at  the  lower  temperature.  The  process  ceases  when  the 
temperatures  of  the  two  bodies  become  equal. 

8. 3/1  The  temperature  of  an  equilibrium  system  is  a  measure 
of  the  intensity  of  thermal  motion  of  its  molecules  (or  atoms 
or  ions).  In  an  equilibrium  system  of  particles  that  obeys  the 
laws  of  classical  statistical  mechanics  (8.2.2),  the  average 
kinetic  energy  of  the  thermal  motion  of  the  particles  is  pro¬ 
portional  to  the  absolute  temperature  (10.2.5). 

Temperature  can  only  be  measured  indirectly,  making  use 
of  the  fact  that  many  physical  properties  of  bodies,  lending 
themselves  to  direct  measurement,  depend  on  the  temperature. 
A  change  in  the  temperature  of  a  body  leads  to  changes  in  its 
length  and  volume,  density,  electric  resistance,  elastic  prop¬ 
erties,  etc.  A  change  in  any  of  these  properties  can  be  used 
to  measure  temperature.  For  this  purpose,  it  is  necessary  for 
a  certain  body,  said  to  be  temperature-indicating,  that  the 
functional  dependence  of  the  given  property  on  the  temperature 
be  known.  Temperature  scales  based  on  temperature-indicating 
bodies  are  said  to  be  empirical. 

In  the  international  centigrade  ( Celsius )  scale,  temperature  is 
measured  in  °C  and  is  denoted  by  t  (degree  centigrade  or  degree 
Celsius).  It  is  assumed  that  at  a  standard  pressure  of  1.01325  X 
X  105  Pa  (Appendix  I)  the  melting  point  of  ice  and  the  boiling 
point  of  water  are  equal  to  0°C  and  100°C. 

In  the  thermodynamic  (absolute)  temperature  scale,  the  tem¬ 
perature  is  measured  in  kelvins  (K)  and  is  denoted  by  T. 

The  relation  between  the  absolute  temperature  and  the  tem¬ 
perature  measured  by  the  Celsius  scale  is:  T  =  273.15  +  t. 

The  temperature  T  —  0  (or  t  =  — 273.15°C)  is  called  absolute 
zero  temperature  (the  impossibility  of  reaching  absolute  zero 
is  discussed  in  Sect.  11.8.4). 

8.3.5  The  state  variables  of  a  system  are  classified  as  being 
either  external  or  internal.  External  state  variables  are  physical 
quantities  depending  upon  the  position  in  space  and  various 
properties  of  bodies  that  are  external  with  respect  to  the  given 
system.  For  a  gas,  for  instance,  the  volume  V  of  the  vessel  in 
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which  the  gas  is  contained  is  an  external  variable  because  the 
volume  depends  upon  the  position  of  external  bodies:  the 
walls  of  the  vessel.  For  a  dielectric  in  an  electric  field,  the 
inlensity  of  the  field,  set  up  by  certain  external  sources,  is  an 
external  variable.  If  a  liquid  is  held  in  an  open  vessel,  the 
atmospheric  pressure,  for  example,  is  an  external  variable. 
The  internal  variables  of  a  system  are  physical  quantities  that 
depend  both  on  the  position  of  bodies  external  with  respect 
to  the  system,  and  on  the  coordinates  and  velocitites  of  the 
particles  that  make  up  the  system.  Internal  variables  of  a  gas, 
for  instance,  are  its  pressure  and  energy  because  they  depend 
upon  the  coordinates  and  velocities  of  the  moving  molecules 
and  on  the  density  of  the  gas. 

8.3.6  The  state  variables  of  a  system  that  is  in  equilibrium 
(8.3.3)  are  not  independent.  The  internal  variables  of  such 
a  system  depend  only  on  its  external  variables  and  on  the 
temperature.  The  equilibrium  state  of  a  simple  system*  of 
given  chemical  composition  and  mass  M  is  determined  by 
specifying  two  variables:  the  volume  V  and  the  temperature  T. 
The  equation  of  state  (thermal  equation  of  state)  of  a  simple 
system  is  the  functional  dependence  of  the  equilibrium  pressure 
P  in  the  system  on  the  volume  and  temperature 

P  =  /  (V,  T). 

The  equation  of  state  is  obtained  experimentally  In  thermo¬ 
dynamics.  In  statistical  physics  (8.2.2),  it  is  derived  theoret¬ 
ically.  This  constitutes  the  interrelation  between  the  statis¬ 
tical  (8.2.2)  and  thermodynamical  (8.2.3)  methods  of  investi¬ 
gation. 

8.3.7  If  any  of  the  external  variables  of  the  system  is  changed, 
the  state  of  a  thermodynamic  system  is  changed.  This  is  called 
a  thermodynamic  process.  A  thermodynamic  process  is  said 
to  be  equilibrium  (an  equilibrium,  quasi-static  process)  if  the 
system  passes  at  an  infinitely  slow  rate  through  a  continuous 
succession  of  infinitely  close  thermodynamic  equilibrium 
slates  (8.3.3).  All  processes  that  do  not  comply  with  these 


*  An  example  of  a  simple  system  is  a  gas  in  the  absence  of 
external  fields.  Mixtures  of  chemically  homogeneous  gases  of 
constant  composition,  chemically  pure  liquids,  etc.  are  also 
simple  systems. 
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conditions  are  said  to  be  non  equilibrium  ones.  Real  processes 
are  nonequilibrium  ones  because  they  proceed  at  a  finite  rate. 
Rut  the  slower  they  proceed,  the  closer  they  are  to  equilibrium 
processes. 

Isoprocesses  are  thermodynamic  processes  that  proceed  in 
a  system  of  constant  mass  with  a  constant  value  of  one  of  the 
state  variables. 

An  isothermal  process  proceeds  at  constant  temperature  ( T  = 
=  const). 

An  isochoric  ( isovolumic )  process  proceeds  at  constant  volume 
(V  —  const). 

An  isobaric  process  proceeds  at  constant  pressure  ( p  =  const). 
An  adiabatic  process  is  a  thermodynamic  process  that  occurs 
in  a  system  without  heat  exchange  (9.2.4)  with  external  bodies 
(see  also  Sect.  9.5.10). 

8.3.8  State  junctions  are  physical  quantities  characterizing  the 
state  of  a  system.  Variation  in  the  stale  functions  during  ther¬ 
modynamic  processes  is  independent  of  the  kind  of  process 
that  occurs.  State  functions  are  uniquely  determined  by  the 
values  of  the  variables  of  the  initial  and  final  states  of  the 
system.  The  simplest  state  functions  of  a  system  are  its  inter¬ 
nal  energy  U  (9.1.2)  and  its  entropy  S  (11.4.2). 


8.4  Equation  of  State  of  an  Ideal  Gas 

8.4.1  An  ideal,  or  perfect,  gas  is  one  whose  molecules  do  not 
interact  with  one  another  at  a  distance  and  are  of  vanishingly 
small  size.  In  real  gases  (12.1.2)  the  molecules  arc  subject  to 
the  forces  of  intermolecular  interaction  (12.1.3). 

In  collisions  with  one  another  and  in  striking  the  walls  of 
the  vessel  the  molecules  of  an  ideal  gas  behave  as  perfectly 
elastic  spheres  (1.1.4)  of  diameter  d  ( effective  diameter  of  the 
molecule),  which  depends  upon  the  chemical  nature  of  the  gas. 
The  existence  of  an  effective  diameter  d  (where  d  ~  10-1J  m) 
signifies  that  forces  of  mutual  repulsion  (12.1.3)  act  between 
the  molecules.  Intermolecular  forces  of  attraction  (12.1.3) 
decrease  drastically  with  an  increase  in  the  distance  r  between 
molecules  and  are  practically  negligible  at  Rm  >  IQ-9  m. 
In  real  gases  at  low  densities  the  average  distance  (r)  between 
molecules  exceeds  Hm  and,  to  a  good  approximation,  they  can 
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bo  regarded  as  ideal.  Hydrogen,  helium,  oxygon  and  nitrogen 
are  assumed  to  be  ideal  gases  at  densities  corresponding  to 
standard  conditions. 

8.4.2  For  a  given  mass  of  an  ideal  gas,  the  ratio  of  the  product 
of  the  numerical  values  of  pressure  and  volume  to  the  absolute 
temperature  is  a  constant  value  ( Clapeyron  equation ): 


The  numerical  value  of  the  gas  constant  C  depends  upon  the 
choice  of  the  units  of  measurement  of  p,  V  and  T. 

If  v  is  the  specific  volume  of  the  gas  (8.3.2)  and  M  is  its  mass, 
then  V  —  Mv  and  the  Clapeyron  equation  assumes  the  form 


BT, 


where  B  —  C/M  is  the  specific  gas  constant  referred  to  unit 
mass. 

8.4.3  From  the  definition  of  the  mole  (Appendix  I),  it  is  evident 
that  moles  of  different  gases  contain  the  same  number  of  mole¬ 
cules.  This  is  called  A  vogadro's  number  N A  (Appen- 
dix  II). 

The  molar  mass  p  of  a  gas  or  any  body  is  the  physical  quantity 
equal  to  the  ratio  of  the  mass  of  the  gas  (or  any  body)  to  the 
number  of  moles  N  it  contains:  p  =  MlN. 

The  molar  mass  is  proportional  to  the  relative  mass  of  the 
molecules  of  a  gas:  p  =  10~3  m/m0,  where  m  is  the  mass  of 
l  he  molecules  of  the  given  gas  and  m0  is  the  atomic  mass  unit 
based  on  the  carbon  scale  (Appendix  II).  The  coefficient  10~3  is 
required  because  molar  mass  is  measured  in  kg/mol  in  the  SI 
(International  System  of  Units). 

The  molar  volume  V ^  is  the  physical  quantity  equal  to  the 
ratio  of  the  volume  V  of  a  gas  to  the  number  of  moles  N  it 
contains:  TV  =  V/N.  The  mass  of  a  mole  is  numerically  equal 
to  p  and  therefore  V..  =  pr,  where  v  is  the  specific  volume 
(8.3.2). 

8.4.4  The  equation  of  state  for  a  mole  of  ideal  gas  is 

nv^ 

— —  =  \\,B  =  R  or  pV^  —  RT. 
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Here  R  is  the  universal  gas  constant,  being  the  gas  constan- 
per  mole  of  gas.  The  universal  nature  of  R  follows  from  Avot 
gadro's  law,  which  states  that  moles  of  all  ideal  gases  occupy 
the  same  volume  at  the  same  pressure  and  the  same  tempera¬ 
ture.  Under  standard  conditions  (T  —  273.15  K  and  p  = 
—  1.0132  X  106  Pa  =  1  atm  =  760  minllg)  a  mole  of  any 
gas  has  a  volume  of  F(l  =  22.415  X  10~3  m®.  From  this  we  can 
calculate  the  numerical  values  of  R  for  various  systems  of 
units  (Appendix  II). 

If  a  volume  V  of  gas  has  the  mass  M  kg,  i.e.  if  it  contains  Ml p, 
moles,  then  V  =  Ml p  X  V ^  and  the  equation  of  state  of  the 
gas  assumes  the  form  called  the  M endeleev-Clapeyron  equation : 

pV  =  —  RT. 

P 

8.4.5  Boltzmann' s  constant  k  is  a  physical  quantity  equal 
to  the  ratio  of  the  universal  gas  constant  R  to  Avogadro’s 
number  TVa,  he.  *  =  R/Njs,.  The  values  of  k  in  various  systems 
of  units  are  listed  in  Appendix  II. 

The  equation  of  state  of  an  ideal  gas,  expressed  by  means  of 
Boltzmann’s  constant,  is  of  the  form 

p==Jl^A_==kn0T, 

where  n0  =  TV a/V^  is  the  number  of  gas  molecules  in  unit 
volume  (molecule  concentration). 

At  constant  temperature,  the  pressure  of  a  gas  is  proportional 
to  the  concentration  of  its  molecules  (or  to  the  density  of  the 
gas). 


CHAPTER  9  FIRST  LAW  OF  THERMODYNAMICS 

9.1  Total  and  Internal  Energy  of  a  System 

9.1.1  An  arbitrary  thermodynamic  system  (8.3.1),  in  any 
thermodynamic  state  (8.3.3),  has  a  total  energy  E,  which  is 
the  sum  of: 

(a)  the  kinetic  energy  F  ™cch  0f  mechanical  motion  of  the  system 
as  a  whole  (or  of  its  macroscopic  parts); 
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lb)  the  potential  energy  E pXt  (3.3.1)  of  the  system  in  all  external 
force  fields  (for  instance,  electromagnetic  or  gravitational 
holds);  and 

(c)  the  internal  energy  U : 

E™ch  +  Ee/l+U. 

9.1.2  The  internal  energy  of  a  body  or  of  a  thermodynamic 
system  (8.3.1)  is  the  energy  depending  only  upon  the  thermody¬ 
namic  state  of  the  body  (or  system).  For  a  fixed  (stationary) 
system,  not  located  in  any  external  force  fields,  the  internal 
t'gergy  coincides  with  the  total  energy.  The  internal  energy 
also  coincides  with  the  rest  energy  of  a  body  (or  system)  (5.7.3) 
and  includes  the  energy  of  all  the  types  of  internal  motion  of 
a  body  (or  system)  and  the  energy  of  interaction  of  all  the 
particles  (atoms,  molecules,  ions,  etc.)  making  up  the  body 
(or  system). 

For  example,  the  internal  energy  of  a  gas  consisting  of  poly¬ 
atomic  molecules  (ammonia,  carbon  dioxide,  etc.)  consists  of: 

(a)  kinetic  energy  of  thermal  translational  and  rotary  motion 
of  the  molecules; 

(b)  kinetic  and  potential  energy  of  vibration  of  the  atoms  and 
molecules; 

(c)  potential  energy  due  to  intermolecular  interactions; 

(d)  energy  of  the  electron  shells  of  the  atoms  and  ions; 

(e)  kinetic  energy  and  potential  energy  of  interaction  of  the 
nucleons  (42.1.1)  in  the  nuclei  of  the  atoms. 

Terms  (d)  and  (e)  do  not  ordinarily  change  in  processes  that 
proceed  at  not  very  high  temperatures  so  that  ionization  and 
excitation  play  no  essential  role.  Under  these  conditions, 
terms  (d)  and  (e)  are  not  taken  into  account  in  the  internal 
energy  balance.  Term  (c)  is  also  neglected  for  an  ideal  gas 
(8.4.1). 

9.1.3  The  internal  energy  is  a  single-valued  function  of  the 
thermodynamic  state  of  a  system.  The  value  of  the  internal 
energy  in  any  state  is  independent  of  the  process  by  means  of 
which  the  system  reached  this  state.  The  change  in  internal 
energy  in  the  transition  of  a  system  from  state  1  to  state  2  equals 
A  U  —  U2  —  l’i  and  is  independent  of  the  kind  of  transition 
I  2  that  occurs.  If  the  system  accomplishes  a  cycle  (11.1.1) 
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the'total  change  in  its  internal  energy  is  equal  to  zero:  dU  — 

=  0. 

As  is  well  known,  such  a  relation  mathematically  signiiies 
that  an  element  of  change  in  the  internal  energy  dU  is  a  com¬ 
plete  (exact)  differential.  This  property  is  also  possessed  by  the 
entropy  (11.4.2)  and  other  state  functions  (8.3.8)  (cf.  Sect.  9.2.5). 
9.1.4  In  a  system  that  is  in  a  state  of  thermodynamic  equilib¬ 
rium  the  internal  energy  depends  only  upon  the  temperature 
and  the  external  variables  (8.3.5).  In  particular,  for  a  simple 
system  (8.3.6)  of  constant  mass  M,  the  internal  energy  is  a  func¬ 
tion  of  the  temperature  T  and  volume  V  of  the  system  ( calorific 
equation  of  state  of  a  simple  system): 

U  =  «p  (F,  T). 

Example  1.  The  internal  energy  of  an  ideal  gas  (8.4.1)  depends 
only  on  its  absolute  temperature  and  is  proportional  to  the 
mass  M  of  the  gas: 

T  T 

U  =  j  CvdT+U0  =  M  (  j  cvdT  +  u^  , 

0  0 

where  Cv  and  cv  =  CvlM  are  the  heat  capacity  (9.5.1)  and 
specific  heat  (9.5.2)  of  the  gas  at  constant  volume  (in  an  isochoric 
process)  (8.3.7),  u0  =  U J M  is  the  internal  energy  of  unit 
mass  of  the  gas  at  T  —  OK.  For  monatomic  gases  at  ordinary 
temperatures  Cv  is  independent  of  T,  and  U  =  CVT  +  U0. 
Example  2.  The  internal  energy  of  a  van  der  Waals  gas  (12.2.1) 
is: 


ff=J  CydT-^^-  +  U0, 

0 

where  M  is  the  mass  of  the  gas,  jx  is  its  molar  mass  (8.4.3), 
and  a  is  the  van  der  Waals’  coefficient  (12.2.3). 

9.1.5  In  thermodynamics,  the  internal  energy  is  determined  to 
an  accuracy  of  the  constant  term  f/0,  whose  value  depends 
upon  the  choice  of  the  reference  point  lor  the  quantity  U, 
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i.e.  from  a  state  with  zero  internal  energy.  The  quantity  U0 
plays  practically  no  role  in  thermodynamical  calculations 
in  which  the  changes  A  U  of  internal  energy,  independent  of 
f/0,  are  determined  (see  also  Sect.  8.1.2). 


9.2  Heat  and  Work 

9.2.1  An  energy  exchange  between  a  thermodynamic  system 
and  external  bodies  takes  place  in  two  ways:  either  by  doing 
work  or  by  means  of  heat  exchange.*  The  amount  of  energy 
transferred  to  the  system  by  external  bodies  in  force  inter¬ 
action  between  the  system  and  the  bodies  is  called  work  done 
on  the  system.  Theamount  of  energy  transferred  to  the  system 
by  external  bodies  by  heat  transfer  is  called  the  quantity  of 
heat  imparted  to  the  system.** 

9.2.2  If  the  thermodynamic  system  is  fixed  (stationary),  a  neces¬ 
sary  condition  to  do  work  is  displacement  of  the  external  bodies 
interacting  with  it,  i.e.  it  is  necessary  to  change  the  external 
state  variables  of  the  system  (8.3.5).  In  the  absence  of  external 
force  fields,  energy  exchange  between  the  fixed  system  and 
the  external  medium  may  take  place  by  doing  work  when  the 
volume  and  shape  of  the  system  are  changed.  According  to  the 
law  of  conservation  of  energy,  the  work  W'  done  on  the  system 
by  external  forces  is  numerically  equal  and  opposite  in  sign 
to  the  work  W  that  the  system  does  on  the  external  medium, 
i.e.  against  the  external  forces: 

W'  =  — W. 

9.2.3  Expansion  work  is  the  work  done  by  the  system  in  over¬ 
coming  the  external  pressure.  An  element  of  expansion  work 
ISW  =  pext  dV,  where  pext  is  the  uniformly  distributed  exter¬ 
nal  pressure,  and  dV  is  an  element  of  change  of  the  system’s 
volume.  If  the  expansion  process  is  an  equilibrium  (quasi- 


*  A  discussion  of  the  third  way  of  energy  exchange— mass  trans¬ 
fer— is  beyond  the  scope  of  the  present  handbook. 

**  Sometimes  the  term  “heat”  is  employed  in  place  of  the  term 
“quantity  of  heat”  if  this  leads  to  no  confusion.  It  should  be 
indicated,  however,  that  heat,  generally  speaking,  is  to  be  under¬ 
stood  as  a  form  of  energy  transfer  (9.2.4). 
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static)  one  (8.3.7),  ;icxt  =  p,  where  p  is  the  pressure  in  the 
system.  Then  8 W  =  p  dV. 

The  significance  of  the  different  ways  used  in  denoting  the 
elements  of  change  bW  and  dU  is  explained  in  Sects.  9.4.3 
and  9.1.3.  The  work  done  in  equilibrium  expansion  of  system 
from  volume  V1  to  volume  V2  is 

Y 

W=  j  p  dV. 

Vi 

Example.  The  work  done  by  a  gas  in  a  vessel  with  a  weightless 
movable  piston  (Fig.  9.1).  The  pressure  of  the  gas  p  >  0  and, 
therefore,  in  its  expansion  ( dV  >  0)  the  gas  performs  positive 
work  (8kF  >  0).  In  compressing  the  gas  (d  V<  0 
„  and  t>W  <  0)  the  positive  work  is  done  on  the 

“ext  gas  by  the  forces  of  external  pressure.  Here 

mthe  gas  does  negative  work. 

The  graphical  representation  of  work  is  dis¬ 
cussed  in  9.4.2. 

9.2.4  Heat  exchange  takes  place  between 
bodies  (or  parts  of  bodies)  heated  to  different 
temperatures.  There  are  three  types  of  heat 

_  exchange:  convection,  conduction  and 

radiation  (radiative  heat  exchange). 

Fig.  9.1  Convective  heat  exchange  is  the  transfer  of 

energy  in  the  form  of  heat  between  the 
unequally  heated  parts  of  gases  or  liquids 
or  between  gases,  liquids  and  solids.  Convective  heat  exchange 
in  liquids  takes  place  by  motion  of  parts  of  the  liquid  with 
respect  to  one  another  or  with  respect  to  solids.  For  instance, 
in  a  radiator  of  a  hot-water  heating  system,  energy  of  the  hot 
water  flowing  through  the  radiator  is  transferred  by  convection 
to  the  less  heated  walls  of  the  radiator. 

The  phenomenon  of  heat  conduction  consists  in  the  transfer  of 
energy  in  the  form  of  heat  from  one  part  of  a  nonuniformly 
heated  body  to  another  part.  Thus,  for  example,  energy  is 
transferred  through  the  walls  of  a  heating  radiator  from  the 
more  heated  inner  surfaces  to  the  less  heated  outer  sur¬ 
faces. 

Radiative  heat  exchange  takes  place  without  direct  contact  of 
the  bodies  exchanging  energy  and  consists  in  the  emission  and 
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absorption  of  the  energy  of  an  electromagnetic  field  by  the 
bodies.  An  immense  amount  of  energy  is  delivered  from  the 
sun  to  the  earth’s  surface  by  radiative  heat  exchange. 

9.2.5  Work  and  heat  are  the  energy  characteristics  of  processes 
for  changing  the  state  of  thermodynamic  systems  and  are 
meaningful  only  when  such  processes  occur.  Depending  upon 
the  kind  of  process  the  system  undergoes  in  its  transition  from 
state  1  to  state  2,  it  is  necessary  to  perform  different  amounts 
of  work  and  to  deliver  different  quantities  of  heat  to  the  system. 
A  comparison  with  Sect.  8.1.3  (on  the  change  in  the  internal 
energy  in  processes)  indicates  that  work  and  heat  are  not  forms 
of  energy  and  it  is  incorrect  to  speak  of  a  “stockpile  of  work” 
or  a  “reserve  of  heat”  in  a  body.  For  the  same  reasons,  an  ele¬ 
ment  of  the  quantity  of  heat  8Q  and  an  element  of  work  5 W 
are  not  complete  differentials  (cf.  Sect.  9.1.3). 

9.2.6  The  performance  of  work  on  a  system  can  change  any 
type  of  its  energy.  For  example,  when  a  gas  is  rapidly  com¬ 
pressed  in  a  vessel  by  means  of  a  moving  piston  (9.2.3),  the 
work  done  by  external  forces  on  the  gas  increases  the  internal 
energy  of  the  gas.  In  the  inelastic  collision  of  two  bodies  (3.5.3), 
a  part  of  the  work  is  expended  to  change  the  kinetic  energy 
of  the  bodies  (3.2.2)  and  a  part  to  change  their  internal  energy. 

If  energy  is  delivered  to  a  system  in  the  form  of  heat,  it  is 
used  only  to  increase  the  internal  energy  of  the  system.  This 
is  due  to  the  fact  that  in  any  kind  of  heat  exchange  (9.2.4), 
energy  is  transferred  directly  between  chaotically  moving 
particles  of  the  bodies.  This  changes  their  internal  energy. 
In  the  process  of  heat  conduction,  for  instance,  in  a  nonuniform- 
ly  heated  rigid  body  the  particles  of  the  body  located  in  its 
more  heated  parts  transmit  some  of  their  energy  to  particles 
located  in  its  less  healed  parts.  The  result  is  an  equalization 
of  the  internal  energies  of  the  various  parts  of  the  body,  an 
equalization  of  their  temperatures  and  the  discontinuation 
of  the  heat  conduction  process. 

It  follows  from  the  aforesaid  that  there  is  a  qualitative  dif¬ 
ference  and  lack  of  equivalence  between  work  and  heat  as 
a  form  of  energy  transmission.  Frequently,  these  two  forms 
of  transmitting  energy  exist  simultaneously.  For  example,  in 
heating  a  gas  in  a  vessel  with  a  moving  piston,  the  volume  of 
the  gas  is  increased  and  work  is  done  against  the  external  pres¬ 
sure  simultaneously. 

9.2.7  A  thermodynamical  system  is  said  to  be  closed  (or  isolated) 
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it  there  is  no  exchange  of  energy  in  any  form  between  it  and 
the  external  medium.  Such  systems  comply  with  the  law  of 
conservation  of  energy:  the  total  energy  of  the  system  remains 
unchanged  regardless  of  the  processes  that  occur  within  it. 
0.2.8  A  thermodynamical  system  is  said  to  be  adiabaticaUy 
isolated  (i.e.  thermally  isolated)  if  no  heat  exchange  takes  place 
between  it  and  the  external  medium.  Such  a  system  can  per¬ 
form  work  on  external  bodies.  Moreover,  external  forces  can 
do  work  on  the  system.  An  example  of  such  a  system  is  a  cylin¬ 
der  with  a  moving  piston,  filled  with  gas  and  surrounded  all 
over  by  a  dense  layer  of  heatproof  felt.  The  absence  of  heat 
exchange  with  the  external  medium  does  not  exclude  the  possi¬ 
bility  of  the  gas  doing  expansion  work  (9.2.3),  or  the  external 
pressure  doing  compression  work  on  the  gas. 

The  properties  of  a  system  approximate  those  of  an  adiabati- 
cally  isolated  one  if  the  change  in  its  state  is  so  rapid  that 
there  is  no  lime,  during  the  process,  for  heat  exchange  with 
the  external  medium.  (For  example,  rapid  expansion  of  a  com¬ 
pressed  gas  released  from  a  gas  cylinder  when  the  outlet  valve 
is  opened  for  a  short  time.) 


9.3  First  Law  of  Thermodynamics 

9.3.1  The  first  law  of  thermodynamics  states  that  the  change 
in  the  internal  energy  At/^  of  a  system  that  occurs  in  the 
process  1  -*■  2  of  transition  of  the  system  from  state  1  to  state  2 
is  equal  to  the  sum  of  the  work  W J_2  done  by  external  forces 
on  the  system  and  the  quantity  of  heat  <?j_2  transmitted  to  the 
system : 

At/1-2  =  Wt'_,  -|-  ^_2, 

where  FF,'_2  —  — FFj_2,  and  W j_2  is  the  work  done  by  the 
system  to  overcome  external  forces  in  the  process  1  — >-  2.  Hence 

Qi-»  =  A  t’1_2  -I-  WV2. 

The  quantity  of  heat  transferred  to  a  system  is  expended  in 
changing  ihc  internal  energy  of  the  system  and  in  the  work 
done  by  the  system  to  overcome  the  external  forces, 
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For  an  element  of  the  quantity  of  heat  6Q,  an  element  of  work 
<W  and  an  infinitely  small  change  dU  of  the  internal  energy, 
Ihe  first  law  of  thermodynamics  assumes  the  form 

l)Q  =  dU  +  8W. 

!)..'5.2  If  6 Q  >  0,  then  heat  is  being  transferred  to  the  system; 
if  8Q  <  0,  then  heat  is  being  removed  from  the  system.  In  the 
finite  process  1  2,  elements  of  the  quantity  of  heat  can  have 

both  signs,  and  the  total  quantity  of  heal  (>1_2  in  the  process 
1  -*■  2  is  equal  to  the  algebraic  sum  of  the  quantities  of  heat 
transferred  during  all  the  parts  of  this  process: 

2 

(>,-2  =  (  6 Q- 

l 

It  is  customary  to  consider  8W  >  0  when  the  system  performs 
work  on  external  bodies,  and  SW  <  0  when  work  is  performed 
by  external  bodies  on  the  system.  The  work  kki_2  done  by  the 
system  in  the  finite  process  1  2  is  equal  to  the  algebraic  sum 

of  the  amounts  of  work  8VF  performed  by  the  system  during 
all  the  parts  of  this  process: 

2 

Wt-,  =  j  bw. 

1 

0.3.3  If  the  system  for  instance,  the  working  medium  (11.1.1) 
in  a  periodic-action  engine  completes  a  cycle  1  1  (11.1.1), 

I  lien  Af71_1  =  0  and  =  Qi-i-  It  is  impossible  to  build 

an  operating  engine  that  could  perform  more  work  than  the 
energy  transferred  to  the  system  from  outside.  Such  an  engine 
is  called  a  perpetual  motion  machine  of  the  first  kind.  The  impos¬ 
sibility  of  constructing  a  perpetual  motion  engine  of  the  first 
kind  is  also  a  statement  of  the  first  law  of  thermodynamics. 

9.4  Graphical  Representation  of  Thermodynamic 
Processes  and  Work 

11.4.1  The  equation  of  state  (8.3.B)  of  a  thermodynamic  system 
enables  the  value  of  any  stale  variable,  for  example  p,  to  be 
determined  from  any  values  of  two  other  variables,  such  as  V 
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and  T.  Therefore,  thermodynamic  processes  can  be  graphically 
represented  in  various  two-dimensional  coordinate  systems. 
In  addition  to  the  most  extensively  used  pV  diagrams,  pT 
and  VT  diagrams  are  also  used. 

In  Fig.  9.2,  the  thermodynamic  process  in  the  pV  diagram 
is  represented  by  the  curve  and  the  points  Ct  (pu  Fx) 

and  C2  (p2,  F2)  indicate  the  initial  and  final  states  of  the 
thermodynamic  system. 

Only  equilibrium  processes  (8.3.7)  can  be  represented  graphi¬ 
cally.  State  variables  for  the  whole  body  (or  system)  cannot 
be  specified  for  nonequilibrium  processes  (8.3.7)  because  they 
differ  at  different  parts  of  the  body  (or  system).  Hence  such 
graphical  representation  of  nonequilibrium  processes  is  im¬ 
possible. 

9.4.2  An  element  of  work  6IF  done  by  a  system  (9.2.3)  in  an 
equilibrium  process  (8.3.7)  can  be  measured  by  the  area  of  the 


trapezoid  having  one  curvilinear  side,  hatched  in  Fig.  9.2. 
The  work  JF,_2  done  by  the  system  in  the  process  C\C2  is  equal  to 

W,-,  =  j  PdV 

Vi 

and  is  measured  by  the  area  that  is  bounded  by  the  curve  re¬ 
presenting  process  CXC2,  the  axis  of  abscissas  and  the  ordinates 
px  and  p2  of  points  Cx  and  C2.  The  amount  of  work  IFX_2  done 
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depends  on  how  the  system  passes  from  state  Ct  to  state  C2, 
i.e.  upon  the  kind  of  process  C1C2  the  system  undergoes  ih 
changing  its  state.  In  the  pV  diagram  of  Fig.  9.3  the  amounts 
of  work  done  by  the  system  in  the  processes  C1LlC2,  C^L2C2 
and  CjL3C2,  and  equal  to  Wpx,  Wl2  and  Wjj2,  respectively, 
are  measured  by  three  areas  that  differ  in  size:  WLt  :>  Wj 2  > 
>  WL,. 

After  the  system  has  completed  the  cycle  (11.1.1) 
the  total  amount  of  work  FFCi_Cl  is  not  equal  to  zero.  The 
positive  expansion  work  in  process  C1LlCi  exceeds  the  negative 
work  done  in  the  compression  process  C2h3Cx.  The  resultant 
positive  work  is  measured  by  the  hatched  area  in  Fig.  9.3. 
it. 4. 3  The  work  IF. and  the  quantity  of  heat  Q  are  not  state 
functions  (8.3.8).  In  various  processes  1  2,  in  which  the 

state  of  a  system  is  changed,  various  quantities  of  heat  are 
transferred  to  the  system,  and  various  amounts  of  work  are 
done.  Elements  of  the  changes  in  the  physical  quantities  bQ 
and  bW  are  not  complete  differentials  (9.1.3). 


9.5  Heat  Capacity  of  Matter.  Applying  the  First  Law 
of  Thermodynamics  to  Isoprocesses  in  an  Ideal  Gas 

9.5.1  The  heat  capacity  C  of  a  body  is  a  physical  quantity, 
numerically  equal  to  the  ratio  of  the  quantity  of  heat  bQ  trans¬ 
ferred  to  the  body  to  the  change  dT  in  the  temperature  of  the 
body  in  the  thermodynamic  process  being  considered: 


I  he  value  of  C  depends  upon  the  mass  of  the  body,  its  chemical 
((imposition,  thermodynamic  slate  and  the  process  in  which 
die  heat  bQ  is  transferred  to  it. 

9.5.2  The  specific  heat  c  is  the  heat  capacity  per  unit  mass  of 
a  substance.  For  a  homogeneous  body  c  =  Cl  M,  where  M  is 
the  mass  of  the  body. 

The  molar  heat  capacity  C ^  is  the  heat  capacity  of  a  single  mole 
(Appendix  I)  of  the  substance:  C ^  =  pc,  where  p  is  the  molar 
mass  (8.4.3)  of  the  substance. 
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9.5.3  The  element  of  the  quantity  of  heat  6Q  that  must  be 
transferred  to  a  body  to  change  its  temperature  from  T  to 
T  -f-  AT  is 

8Q  =  C  dT. 


I2 


For  a  homogeneous  body  S(i  —  Me  dT  —  ( Ml\x )  C„  dT,  where 
M  is  the  mass  of  the  body,  p  is  its  molar  mass  and  M/p  is  the 

number  of  moles  in  the  body. 
9.5.4  For  equilibrium  (8.3.7)  iso¬ 
processes  in  gases  (8.3.7),  the  first 
law  of  thermodynamics  (9.3.1)  is 
of  the  form 

—  C(l  dT  =  dU  -)-  p  dV. 

In  the  isochoric  process  (8.3.7) 
of  heating  or  cooling  a  gas  (straight 
^  lines  1-2  and  1-3  in  Fig.  9.4), 
...  n  .  no  element  of  work  8  W  —  p  dV 

i<18'  is  done  (because  dV  =  0).  The 

whole  quantity  of  heat  8(7  trans¬ 
ferred  to  the  gas  is  expended  in  changing  its  internal  ener¬ 
gy:  6Q  =  dU.  If  Cr  is  the  molar  heat  capacity  at  constant 

volume, 


M 

dU  =  —  CVl,  dT. 
p  v- 


(*) 


Within  a  definite  temperature  range  it  can  be  assumed  that 
Cv  »  const  (for  greater  detail  see  Sect.  10.7.4)  and  that  the 
change  in  the  internal  energy  of  the  gas  A as  its  temper¬ 
ature  is  changed  from  Tx  to  T 2,  is  equal  to  AI7j_2  —  U2 —  U i  = 
=  (Ml p)  Cv  (T2  —  1\)  and  is  due  to  the  quantity  of  heat 

(h_2  transferred  to  the  gas  in  the  isochoric  process: 

Ql-i=^Cyil(T2-TJ).  (**) 

At  T2  >  Tu  >  0  and  a  definite  quantity  of  heat  is  trans¬ 
ferred  to  gas;  at  T2  <  Tlt  (h_2  <  0  and  a  definite  quantity 
of  heat  is  conducted  away  from  the  gas. 
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9.5.5  For  an  ideal  gas,  equations  (*)  and  (**)  express  the  change 
in  its  internal  energy  in  any  process  for  changing  the  state  of 
the  gas  in  the  temperature  range  (T2-T}).  The  internal  energy 
of  an  ideal  gas  depends  only 
on  its  chemical  composition, 
mass  and  temperature. 

In  real  gases  (12.1.2)  the  inter¬ 
nal  energy  includes  the  poten¬ 
tial  energy  of  interaction  be¬ 
tween  the  molecules  (9.1.2), 
which  depends  upon  the  dis¬ 
tance  between  them.  This  part 
of  the  internal  energy  of  a 
real  gas  is  changed  with  changes 
in  its  volume.  Therefore, 
equations  (*)  and  (**)  express 
the  change  in  the  internal 
energy  of  a  real  gas  only  when  it  is  heated  or  cooled  in  an 
isochoric  (constant-volume)  process. 

9.5.6  In  an  arbitrary  equilibrium  process  (8.3.7)  that  takes 
place  in  an  ideal  gas,  the  first  law  of  thermodynamics  assumes 
the  form 

—  CudT  =  —CVlldT  +  pdV, 

|i  *  p  & 

where  Cfl  is  the  molar  heat  capacity  of  an  ideal  gas  in  the  given 
process.  The  isobaric  process  (8.3.7)  of  heating  (straight  line 
1-2  in  Fig.  9.5)  or  cooling  (straight  line  1-3  in  Fig.  9.5)  a  gas 
can  be  carried  out,  for  instance,  in  a  vessel  (cylinder)  closed  by 
a  movable  piston  subject  to  a  constant  external  pressure.  An 
element  of  work  6 IF  done  by  an  ideal  gas  in  an  isobaric  process  is 

W  =  p  dV  =  —  R  dT, 

p 


Fig.  9.5 


in  which  use  is  made  of  the  expression  dV  = - dT  from 

p  p 

Ike  Mendeleev-Clapeyron  equation  (8.4.4)  at  p  =  const. 

The  universal  gas  constant  R  (8.4.4)  is  numerically  equal 
to  the  work  done  by  one  mole  of  an  ideal  gas  when  it  is  heated 
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isobarically  by  one  degree: 

( Mlp)dT  ■ 

The  work  H/1_2  done  by  the  gas  in  isobaric  expansion  1-2  is 

? 

H/i-2  =  J  pdF  =  p(F2-Fl). 

Vi 

It  is  represented  by  the  hatched  area  in  Fig.  9.5.  For  an  ideal 
gas  the  work  F71_2  is  also  equal  to 

r 

9.5.7  The  element  of  the  quantity  of  heat  6 Q  transferred  to 
the  gas  in  the  isobaric  process  is 

«<?  =  -~CmdT, 


where  C  P]l  is  the  molar  heat  capacity  at  constant  pressure. 
If  the  quantity  CPil  can  be  assumed  constant  in  the  temperature 
range  (T2-Tj),  then  the  quantity  of  heat  ()[_2  transferred  to 
(or  conducted  away  from)  the  gas  is 

Qi-»  =  -~CpVi(Tt-T1). 

( 1 

9.5.8  The  molar  heat  capacities  Cpji  and  CVVL  of  a  gas  are  re¬ 
lated  by  the  Mayer  equation 


cpn  ~  cVu  —  ■R- 

For  specific  heats  cp  and  cv,  the  Mayer  relation  is  of  the  form 

Cp  —  Cy  =  R/\l, 

where  |i  is  the  molar  mass  (8.4.3)  of  the  gas. 

For  the  heat  capacities  Cp  and  Cv 

Cp—Cv  =  R, 
r 
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where  M  is  the  mass  of  the  gas  and  M/\i  is  the  number  of  moles 
i  t,  contains. 

The  Mayer  relation  indicates  that  in  heating  a  gas  isobarically 
by  one  degree  a  greater  quantity  of  heat  must  be  transferred 
to  the  gas  than  in  isochoric 
heating  by  the  same  amount.  P\ 

This  difference  between  the 
quantities  of  heat  should  be 
equal  to  the  work  done  by  the 
gas  in  its  isobaric  expansion. 

11.5.9  The  isothermal  process 
(8.3.7)  of  expanding  (or  com¬ 
pressing)  a  gas  takes  place  in 
neat  exchange  (9.2.4)  between 
the  gas  and  the  external  me-  o 
dium  at  a  constant  tempera- 
lure  difference.  For  this  to  be  V,  /•  V 

feasible,  the  heat  capacity 
(9.5.1)  of  the  external  medium  Fig.  9.6 
should  be  sufficiently  great  and 
the  process  of  expansion  (or 

compression)  should  proceed  extremely  slowly.  First-order 
phase  transitions  (12.3.3),  such  as  boiling  and  condensa¬ 
tion,  are  isothermal  processes  if  they  proceed  at  constant  ex¬ 
ternal  pressure. 

In  an  isotherma-l  process  in  an  ideal  gas  the  internal  energy 
of  the  gas  remains  unchanged  (9.5.5)  and  the  whole  quantity 
of  heat  (?!_ 2  transferred  to  the  gas  is  expended  in  the  work 
lf/1_2  done  by  the  gas  to  overcome  the  external  forces: 


J  pdv  =  ^r 


—  RT  in 
9  Vi 


Here  M/[i  is  the  number  of  moles  contained  in  mass  M  of  the 
gas,  T  is  the  constant  temperature  of  the  gas,  and  V1  and  V2 
lire  the  initial  and  final  volumes  of  the  gas.  If  the  gas  isother- 
mally  expands  (F2  >  Fx),  heat  is  being  transferred  to  it 
K>i_2  >  0)  and  it  is  doing  positive  work  (IFX_ 2  >  0)  represented 
by  the  hatched  area  in  Fig.  9.6.  In  isothermal  compression  of 
i  he  gas  (process  1-3  in  Fig.  9.6),  the  work  done  by  the  gas  is 
negative  (JFX_3  <  0).  Positive  work  (IF;_3  =  — Wt_3  >  0)  is 


111* 


148 


FIRST  LAW  OF  THERMODYNAMICS  [9 


being  done  by  the  external  forces  to  compress  the  gas.  In  com¬ 
pression,  a  certain  quantity  of  heat  is  conducted  away  from  the 
gas  (Qi-3  <  0).  The  heat  capacity  of  a  substance  in  an  isother¬ 
mal  process  equals  infinity  (dT  =  0  and  6 Q  0). 

9.5.10  An  adiabatic  process  (8.3.7)  proceeds  under  the  condi¬ 
tion  that  8Q  =  0.  It  is  essential  that  the  condition  Q  =  0 
does  not  define  this  process,  because  it  does  not  specify  the 
absence  of  heat  exchange  with  the  external  medium.  It  only 
indicates  that  the  algebraic  sum  of  the  quantities  of  heat  trans¬ 
ferred  to  and  conducted  away  from  the  gas  in  various  parts 
of  the  process  equals  zero.  In  an  adiabatic  process,  work  is 
done  by  an  ideal  gas  at  the  expense  of  its  internal  energy: 

tw=—du  =  cVyl  dr, 

where  CV}1  is  the  molar  heat  capacity  of  the  gas  at  constant 
volume,  Af/p  is  the  number  of  moles  contained  in  mass  M  of 
the  gas,  and  dT  is  an  element  of  change  in  the  temperature 
of  the  gas.  If  the  gas  adiabatically  expands  (6 IP  =  p  dV  >  0), 
it  is  cooled  (dT  <  0).  In  adiabatic  compression  the  gas  is 
heated:  6 IP  =  p  dV  <  0  and  dT  >  0. 

9.5.11  The  adiabatic,  or  Poisson,  equation:  p  VM  =  const,  is 
valid  for  an  equilibrium  adiabatic  process  (8.3.7). 

Making  use  of  the  Mendeleev-Clapeyron  equation  (8.4.4),  the 
relation  between  p  and  T,  and  also  between  V  and  T,  in  an 
adiabatic  process  can  be  found  from  the  adiabatic  equation: 

X  1 

pTK~  1  =  const  and  F  TK~l  =const. 

In  these  equations  the  dimensionless  quantity  x  =  CvjCVp 
=  cp/cy  >  1  is  the  adiabatic  exponent.  The  full  curve  in 
Fig.  9.7,  called  an  adiabat  or  adiabatic  curve,  represents  an 
adiabatic  process  in  a  pV  diagram.  The  dashed  curve  on  this 
diagram,  called  an  isotherm,  represents  an  isothermic  process 
at  the  temperature  corresponding  to  the  initial  state  1  of  the 
gas.  In  an  adiabatic  process  the  pressure  varies  with  the  volume 
more  rapidly  than  in  an  isothermic  process  (the  curve  is  steep¬ 
er).  In  adiabatic  expansion  the  temperature  of  the  gas  and  its 
pressure  decrease  faster  than  in  the  corresponding  isothermal 
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expansion.  In  adiabatic  compression  of  the  gas  its  pressure 
increases  at  a  higher  rate  than  in  the  corresponding  isothermal 
compression.  The  reason  is  that  the  increased  pressure  is  due 
both  to  the  decrease  in  the  volume  of  the  gas  and  to  the  increase 
in  its  temperature. 

9.5.12  The  work  W?1_2  done  by  the  gas  in  the  adiabatic  process 
1  -*■  2  is  measured  by  the  hatched  area  in  Fig.  9.7. 


The  amount  of  work  H/l_2  done  in  an  adiabatic  process  is  ex¬ 
pressed  by 


li 

T 

aT 

■%-Cvv.  (T1~T2), 

r 

(T.-T,), 

ii 

i 

Pl^l  I  A  ^2  \ 

x —  1  V  Tx  )  ' 

«■-!  [*- 

-(ftr’j 

In  these  equations  M/\x  is  the  number  of  moles  contained  in 
mass  M  of  the  gas,  CVu  is  the  molar  heat  capacity  at  constant 
volume,  x  is  the  adiabatic  exponent,  and  p,  V  and  T  are  the 
slate  variables  of  the  gas  in  the  slates  1  and  2  as  indicated. 
9.5.13  Table  9.1  lists  summarized  information  on  the  isopro- 
cesses  in  gases, 
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CHAPTER  10  KINETIC  THEORY  OF  GASES 


10.1  Certain  Information  on  Classical 
Statistical  Physics 

10.1.1  The  kinetic  theory  of  gases  deals  with  the  structure  and 
physical  properties  of  gases  and  is  based  on  the  statistical 
method  of  investigation  (8.2.2).  Besides  those  mentioned  in 
Sect.  (8.2.1),  the  following  propositions  form  the  foundation 
for  classical  statistical  physics. 

10.1.2  A  system  of  particles  complies  with  the  laws  of  conser¬ 
vation  of  energy  (9.2.7),  momentum  (2.7.1)  and  of  angular 
momentum  (4.4.1).  A  system  of  charged  particles  complies 
with  the  law  of  conservation  of  electric  charge  (14.1.3). 

10.1.3  All  physical  processes  in  a  system  of  particles  proceed 
continuously  in  space  and  time.  A  space-time  description  of  any 
physical  phenomena  in  classical  mechanics  and  classical  sta¬ 
tistical  physics  assumes  the  possibility  of  continuous  variation 
of  all  physical  quantities  that  characterize  the  state  of  the 
system.  For  instance,  the  velocity  and  energy  of  any  particle 
can  vary  continuously  due  to  the  action  of  various  forces. 

10.1.4  Each  particle  in  a  system  is  regarded  as  being  “tagged”. 
Ttcanbe  distinguished  from  all  other  such  particles  (distin- 
guishability  of  identical  particles  in  classical  statistical  physics). 

10.1.5  Any  particle  of  a  system  can  have  arbitrary  values  for 
its  coordinates  and  momenta  (or  velocities)  independently 
of  the  values  of  these  quantities  for  other  particles  of  the  system. 
If  an  infinitely  small  volume  dx  dy  dz  is  chosen  in  the  coor¬ 
dinate  system  XYZ ,  any  particle  can  be  located  within  this 
volume  regardless  of  the  presence  in  this  volume  of  any  arbi¬ 
trary  number  of  other  particles.  Likewise,  any  particle  may 
be  located  within  an  arbitrary  element  of  “volume”  dpx  dpy  dpz 
of  momentum  space  (or  dux  duy  duz  of  velocity  space)  regardless 
of  the  presence  in  these  “volumes”  of  any  arbitrary  number  of 
other  particles.  This  implies  that  any  particle  can  have  momen¬ 
tum  components  along  the  coordinate  axes  within  the  limits 
from  px  to  px  +  dpx,  from  py  to  py  -j-  dpy  and  from  pz  to 
Pz  +  dp z  (and,  respectively,  velocity  components  of  the  par¬ 
ticle  along  the  coordinate  axes  within  the  limits  from  ux  to 
ux  -j-  dux,  uy  to  Uy  +  duy  and  uz  to  uz  +  duz).  The  minimuiq 
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values  of  the  volumes  dx  dy  dz  and  dpx  dpy  dpz  are  not  restricted 
in  any  manner. 

Sections  10.1.2  and  10.1.3  apply  not  only  to  classical  statistical 
physics,  they  are  characteristic  of  all  of  classical  physics. 
Sections  10.1.4  and  10.1.5  apply  only  to  classical  statistical 
physics;  they  do  not  hold  for  quantum  statistics  (41.1.1). 


10.2  Basic  Equation  of  the  Kinetic  Theory  of  Gases 

10.2.1  The  pressure  of  a  gas  (8.3.2)  in  a  vessel  is  the  result 
of  the  collisions  of  the  gas  molecules  with  the  walls  of  the 
vessel.  The  pressure  of  a  gas  is  the  macroscopic  effect  of  the 
thermal  motion  of  molecules  (8.1.1).  As  the  molecules  of  the 
gas  strike  the  walls  of  the  vessel  (8.4.1)  they  impart  their 
momentum  (2.3.4)  to  the  walls.  This  change  in  momentum  of 
the  molecules  exerts  the  pressure  on  the  walls.  For  an  ideal 
gas  the  collisions  between  the  molecules  within  the  vessel  has 
no  effect  on  the  pressure  exerted  by  the  gas  on  the  walls.  Owing 
to  the  chaotic  nature  of  thermal  motion  of  the  molecules  (8.1.3) 
the  pressure  of  the  gas  is  the  same  on  all  the  walls  and,  by 
definition  (8.3.2),  is  the  average  force  exerted  normal  to  unit 
area  of  the  wall  surface. 

10.2.2  The  basic  equation  of  the  kinetic  theory  of  gases  is 
PV  =  ~  Ek, 

where  p  is  the  pressure  of  the  gas,  V  is  its  volume  and 

N  2 
v\  miui 

Ek  =  2_i  "  2  .  *s  t^ie  total  kinetic  energy  of  translational 

i  =  l 

motion  of  all  the  N  molecules  of  gas  in  the  vessel. 

10.2.3  In  a  homogeneous  gas  m;  =  m,  i.e.  the  mass  of  each 
molecule  is  the  same,  but  their  velocities  u.-  differ  (10.3.1), 
and 


N 


i=l 
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It  proves  expedient  to  introduce  the  root-mean-square  velocity 
;;rnls  of  translational  motion  of  gases: 

* 

l,rms=  |/  uV 

i= 1 

where  N  is  the  total  number  of  molecules  in  volume  V  of  the 
gas.  The  root-mean-square  velocity  pertains  to  the  whole 


Fig.  10.1 


population  of  molecules,  but  is  meaningless  in  application 
to  a  single  molecule  or  to  a  small  number  of  molecules.  Making 
use  of  erms  the  expression  for  Ey_  assumes  the  form 

l'j  k  ==  ~2~  ^mvrms  ant^  TT  -g-  Nmv*ms  =-g-  IUbjjng, 

where  M  —  Nm  is  the  mass  of  the  gas. 

The  basic  equation  for  the  pressure  of  a  gas  is 

2  1  1 
V  =-g-£ko=—  n0mvjms  =  T  pvlms, 

where  EllQ  =  Eh/V,  n0  =  N/V  is  the  number  of  molecules  in 
unit  volume,  and  p  =  n0m  is  the  density  of  the  gas, 
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10.2.4  From  a  comparison  of  the  Mendeleev-Clapeyron  equation 
(8.4.4)  with  the  basic  equation  (10.2.3)  RT  =  in>pms/3  it  follows 
that 


^rms = 


where  k  is  Boltzmann’s  constant  (8.4.5),  m  is  the  mass  of  a  mol¬ 
ecule,  N a  is  Avogadro’s  number  (Appendix  II),  p  is  the  pres¬ 
sure  of  the  gas,  and  v  is  its  specific  volume. 

10.2.5  The  average  kinetic  energy  of  translational  motion  of  the 
molecules  in  an  ideal  gas  is 


mv 


2 

rms 


2 


3_ 

2 


kT. 


The  average  kinetic  energy  (E™)  is  proportional  to  the  abso¬ 
lute  temperature  and  is  independent  of  any  other  variables 
(Fig.  10.1).  The  absolute,  or  thermodynamic,  temperature  is 
a  measure  of  the  average  kinetic  energy  of  translational  motion 
of  the  molecules  in  an  ideal  gas.  This  consequence  of  the  kinetic 
theory  of  gases  is  not  valid  in  the  region  of  ultralow  tempera¬ 
tures,  close  to  absolute  zero  (8.3.4). 


10.3  Maxwell's  Molecular  Velocity  and  Energy 
Distribution  Law  (Maxwell's  Distribution  Law) 

10.3.1  The  law  of  the  distribution  of  molecular  velocities  for  an 
ideal  gas,  theoretically  established  by  Maxwell,  determines  the 
number  dn  of  molecules  of  a  homogeneous  monatomic  ideal 
gas,  out  of  the  total  number  n0  of  its  molecules  in  unit  volume, 
that  has  a  velocity,  at  a  given  temperature,  lying  within  the 
range  from  u  to  u  4-  du.  This  law  is  valid  for  gases  in  a  state 
of  thermodynamic  equilibrium  (8.3.3).  The  velocity  distribu¬ 
tion  of  the  molecules  of  such  a  gas  is  a  steady-state  value. 
The  Maxwellian  distribution  is  based  on  pairwise  collisions  of 
the  randomly  moving  gas  molecules.  This  assumes  uniform 
distribution  of  the  molecules  throughout  the  volume  of  the 
vessel,  i.e.  constant  density  of  the  gas, 
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10.3.2  In  its  most  customary  form,  the  law  of  the  distribution 
of  molecular  speeds  (magnitudes  of  the  molecular  velocities) 
is  presented  as 

mu  2 

(  rn  \  3/2  -  2 hT  ,  ,  , 

rfn  =  re«ll^Fj  e  4nu  du‘ 

Here  u  is  the  magnitude  of  the  molecular  velocity  (speed  of 
a  molecule),  m  is  the  mass  of  a  molecule,  k  is  Boltzmann’s 


constant,  and  T  is  the  absolute  (thermodynamic)  temperature. 
Molecular  velocity  distribution  curves  for  various  temperatures 
Tt  <  T2  <  T3  are  shown  in  Fig.  10.2.  As  is  evident  from  these 
curves,  the  most  probable  velocity  up  (10.3.4)  increases  with 
temperature  and  the  fraction  of  molecules  travelling  at  this 
velocity  decreases. 

10.3.3  Another  form  of  the  Maxwellian  distribution  is 

mu * 

(  m  \3/2  2fcr"  ,  , 

dn  =  n0  2n'fcf~ )  e  duxduyduz, 

where  ux,  uy  and  uz  are  projections  of  the  molecular  velocity 
on  the  coordinate  axes.  Owing  to  the  random  nature  of  thermal 
motion  of  the  molecules  (8.1.1),  distributions  of  the  molecules 
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according  to  the  projections  ut  of  their  velocities  on  the  coor¬ 
dinate  axes  (i  =  x ,  ;/,  z)  are  mutually  independent.  Hence 

dn  —  n0f  (ux)  f  (uy)  /  (it2)  dux  duy  duz, 

where 


is  a  distribution  function  of  the  projections  of  the  molecular 
velocities. 

The  Maxwellian  distribution  is  isotropic.  This  is  evident  from 
the  fact  that  the  function  /  (ux,  uy,  uz)  depends  only  upon 
the  speed  (magnitude  of  the  velocity),  and  because  the  function 
/  («;)  is  the  same  along  all  the  axes. 

10.3.4  The  law  of  the  distribution  of  molecular  velocities 
(10.3.2)  can  be  used  to  determine  the  most  probable  velocity 
up  of  the  molecules,  which  corresponds  to  the  maximum  of  the 
function 

mu2 

,  ,  ,  dn  ,  t  m  \  1/2  ~~2 uf 
1  (“)=-^r=4™o  [-^r]  ' 

muz 

The  condition  [•—-(«  2hT  u2)  1  =0  yields 

I  du  Ju=up 

,  /  2kT  ,  /  2 RT  ,  /  T 

“p-=  V  —=v^V  t- 

10.3.5  A  third  form  of  the  law  of  the  distribution  of  molecular 
velocities  is 

dn  =  _^e-uyui<u_y  du_ 
y  n  \  up  /  up  ' 


The  fraction  dn/n0  of  the  molecules  of  a  gas  having  velocities 
lying  within  the  range  from  u  to  u  -j-  du  is  numerically  equal 
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to  the  hatched  area  dS  under  the  curve  in  Fig.  10.3,  which 
represents  the  dependence  of  ( up/n0 )  ( dnldu )  on  u/up: 

dg  _  f  “P  dn  \  du  dn 

l  «o  du  )  Up  na 

The  area  under  the  curve  in  Fig.  10.3,  i.e.  bounded  by  the 
curve  and  the  axis  of  abscissas,  is  equal  to  unity.  This  area 


represents  the  fraction  of  the  molecules  having  all  possible 
velocity  values  from  0  to  oo. 

10.3.6  The  arithmetic  mean  velocity  ( u )  of  translational  motion 
of  the  molecules  in  an  ideal  gas,  calculated  by  means  of  the 
distribution  law  (10.3.2),  is 


(u)  =  vr 


l/  »  l/  — =/ — *  /*. 

r  3n  V  up  V  nm 


10.3.7  The  energy  distribution  of  the  molecules  in  an  ideal 
gas  determines  the  fraction  dn  m/n0  of  molecules,  of  their 

E 
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total  number  n0,  having  kinetic  energies  Z?“  =  mu2/ 2  lying 
within  the  range  from  E™  to  E™  +  <ZZ?™: 


dn , 


E “  __ 

(kT)~3/2  Y  dE£. 

y  n 


Here  dnEmln0  —  )  (£jf)  where  /  (//“)  is  the  distribution 
function  for  the  molecular  energies  in  an  ideal  gas. 
Example.  The  average  kinetic  energy  )  of  a  molecule  in 
an  ideal  gas  is 


<Z?™)=j  Ef1(K)dE™ 

o 


oo 

2 _  f  pm 

yn  (at)3/2  J  "k 


£k  _ 

e  hT  V  Ef  dE “ 


(cf.  Sect.  10.2.4). 

10.3.8  The  relative  motion  (7.1.2)  of  two  particles  with  the 
masses  and  m2  is  equivalent  to  the  motion  of  a  single  particle 
with  the  reduced  mass  mred  =  m1m2/(m1  +  m2).  For  a  homo¬ 
geneous  gas  m1  —  m2  —  m  and  mre( j  =  ml 2.  The  distribution 
of  molecules  according  to  their  relative  velocities  establishes 
the  fraction  dnUrelln0  of  molecules,  of  their  total  number 
having  relative  velocities  nre]  lying  within  the  range  from 
wrel  I*5  urel  durei: 


,  I  m  \  3/2 

dn“rel  =  ”o  {-^kr)  C 


mu2el 
4  hT 


4JtU2e]  duve]. 


Here  dnUlei/n0  =  f  (urei)  dure i,  where 
3/2  muiel 

/  (urei)=4n  (  )  e  ikT  Kre]  *s  t^ie  relative  velocity 

distribution  of  the  molecules  in  an  ideal  gas. 
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Example.  The  average  relative  velocity  {ure j)  of  the  mole¬ 
cules  is 


(ure l)  —  ^  “rel/  (“rel)  ^“rel  V  2  "(/" — -  1^2  (u), 

0 

where  ( u )  is  the  arithmetic  mean  velocity  of  the  molecules 
(10.3.6). 

10.4  Particle  Distribution  in  a  Potential  Force  Field 
(Boltzmann  Distribution) 

10.4.1  Gases  are  ordinarily  in  the  potential  (conservative) 
gravitational  fields  of  the  earth  (6.2.1).  If  there  were  no  such 
gravitational  field,  the  atmospheric  air  would  be  dispersed 
in  the  universe.  If,  on  the  other  hand,  there  were  no  thermal 
motion,  the  molecules  of  atmospheric  air  would  drop  to  the 
earth.  Gravity  and  thermal  motion  lead  to  a  steady  state  of 
gases,  in  which  their  concentration  and  pressure  decrease  with 
an  increase  of  their  altitude  above  the  earth’s  surface. 

10.4.2  If  a  gas  (or  other  system  of  particles)  is  located  in  an 
external  potential  (conservative)  force  field  (3.1.6),  the  distri¬ 
bution  of  the  particles  throughout  the  volume  is  described  by 
the  Boltzmann  distribution  law.  This  law  establishes  the  number 
of  particles  dn  ( x ,  y,  z)  that  have  coordinates  within  the  ranges 
from  x  to  x  +  'dx,  from  y  to  y  -j-  dy,  and  from  z  to  z  -j-  dz. 
In  other  words,  dn  (x,  y,  z)  is  the  number  of  molecules  contained 
in  the  element  of  volume  dV  —  dx  dy  dz.  The  Boltzmann 
distribution  law  is  of  the  form 

£pV>  v,  z) 

dn(x,  y,  z)  =  const  Xe  kT  dx  dy  dz 

where  (x,  y,  z)  is  the  potential  energy  of  a  particle  in  the 
external  force  field,  k  is  Boltzmann’s  constant,  and  T  is  the 
absolute  temperature.  The  value  of  the  constant  in  the  preceding 
equation  is  determined  from  the  normalization  condition: 

j  dn=  n0,  where  n0  is  the  total  number  of  particles  in  unit 
volume. 
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Example.  For  particles  of  mass  m,  located  in  the  earth’s  gravi¬ 
tational  field,  El"  =  mgh  (3.3.3),  where  g  is  the  acceleration 
due  to  gravity  (7.3.3),  and  h  is  the  altitude.  At  each  altitude 
there  is  a  Maxwellian  molecular  velocity  distribution  (10.3.1). 
The  number  of  molecules  contained  in  the  volume  dV  is 

mgh 

dn(x,  y,  z)  =  const  Xe  kl  dV . 

The  gas  density  p  =  m  dnldV  decreases  with  an  increase  in  the 
altitude  according  to  the  exponential  law 

mgh 

p  =  const  Xe  hT  ■ 

The  value  of  the  constant  can  be  determined  from  the  condition 
p  =  p0  =  const  at  h  =  0.  The  density  of  the  gas  or  its  pressure 
varies  according  to  the  barometric  height  formula 

mgh  mgh 

P  =  Po«  hT  and  p  =  p0e  hT  . 

10.4.3  The  M axwell-B oltzmann  law,  or  distribution,  describes 
the  distribution  of  the  coordinates  and  velocities  of  gas  mole¬ 
cules  subject  to  the  action  of  an  arbitrary  potential  (conserva¬ 
tive)  force  field: 

P2x+P2y+P2z  Kp  («>  V’  A 

dre  =  constXe  2 mhT  dpxdpydpze  hT  dx  dy  dz, 

where  dn  is  the  number  of  molecules  contained  in  six-dimensio¬ 
nal  space  within  the  element  of  volume  dY  =dx  dy  dz  dpx  dpy  dpz, 
and  (x,  y,  z)  is  the  potential  energy  of  a  molecule  in  the 
external  force  field  at  a  point  with  the  coordinates  x,  y  and  z, 
and  the  projections  px,  py  and  pz  of  the  momenta  on  the  axes. 
The  Maxwell-Boltzmann  law  is  the  product  of  two  distribution 
functions.  One  of  them  describes  the  distribution  with  respect 
to  coordinates,  and  the  other  with  respect  to  momenta  (or 
velocities). 
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10.5  Mean  Free  Path  of  Molecules 

10.5.1  Gas  molecules  have  finite  dimensions  (8.4.1)  and  are 
continuously  colliding  with  one  another  in  thermal  motion. 
Between  two  successive  collisions,  molecules  travel  at  uniform 
velocity  in  a  straight  line  over  certain  distances  called  their 
free  paths  X.  The  mean  free  path  (X)  is  the  average  distance  that 
a  molecule  travels  without  a  collision.  The  mean  free  path  is 
a  characteristic  of  the  whole  totality  of  gas  molecules  at  given 
values  of  p  and  T. 

10.5.2  In  unit  time  each  molecule  undergoes  a  mean  number 
of  collisions  (z)  (average  collision  frequency)  equal  to 

(z)  =  nd2na  (iirel)  =  1^2  nd'2n0  ( u ). 

Here  d  is  the  effective  diameter  of  the  molecule  (8.4.1),  n0 
is  the  number  of  molecules  in  unit  volume  of  the  gas,  <«rei) 
is  their  average  relative  velocity  (10.3.8),  and  ( u )  is  their 
arithmetic  mean  velocity  (10.3.6). 

10.5.3  The  average  distance  travelled  by  a  molecule  in  unit 
time  is  numerically  equal  to  (u).  Therefore  (u)  —  (X)  (z). 
The  mean  free  path  ( X )  is 

w  =  i 

(z>  / 2  nd2n0 

At  constant  temperature,  n0  is  proportional  to  the  pressure  p 
of  the  gas  (8.4.5).  Hence,  for  a  given  gas  the  mean  free  path 
is  inversely  proportional  to  the  pressure: 

Pi  (^i)  ~  P 2  (^2)  ~  const, 

in  which  the  subscripts  1  and  2  refer  to  two  states  of  the  gas. 

10.5.4  If  molecules  are  ejected  from  some  source  of  particles 
and  a  beam  is  formed  by  means  of  a  diaphragm,  the  free  path 
distribution  law  is  valid  for  the  molecules  of  the  beam: 

N  =  N0e~x'<X), 

where  N  is  the  number  of  molecules  in  the  beam  that  have 
travelled  a  distance  x  without  collisions,  and  N0  is  the  number 
of  molecules  in  the  beam  at  x  =  0,  i.e.  at  the  output  of  the 
diaphragm. 


1 1  —  0186 
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10.6  Principle  of  the  Equipartition  of  Energy. 
Infernal  Energy  of  an  Ideal  Gas 


10.6.1  The  number  of  degrees  of  freedom  of  a  body  is  the  minimum 
number  of  coordinates  (number  of  independent  coordinates) 
that  it  is  necessary  to  specify  in  order  to  completely  determine 
the  position  of  the  body  in  space. 

A  particle,  for  instance,  travelling  along  one  of  the  coordinate 
axes  has  a  single  degree  of  freedom.  The  same  particle  travelling 

in  a  plane  has  two  degrees  of 
freedom.  The  position  of  a  par¬ 
ticle  travelling  freely  in  space 
is  determined  by  three  degrees 
of  freedom:  the  coordinates  x, 
y  and  z.  A  rigid  body  (1.1.4) 
has  six  degrees  of  freedom:  its 
position  in  space  is  determined 
by  the  three  coordinates  of  its 
centre  of  mass  (2.3.3),  two  an¬ 
gles,  0  and  (p,  determining  the 
direction  of  a  certain  axis, 
fixed  in  the  body  and  passing 
through  its  centre  of  mass 
(Fig.  10.4),  and,  in  addition, 
angle  i[),  determining  the  direction  of  a  second  axis,  fixed 
in  the  body  and  perpendicular  to  the  first  axis.  Changes 
in  the  three  coordinates  of  the  centre  of  mass  at  given 
constant  angles  0,  <p  and  i|)  correspond  to  translational  motion 
of  the  rigid  body.  The  coordinates  of  the  centre  of  mass  are 
the  three  degrees  of  freedom  of  translational  motion.  Changes 
in  the  angle  0,  tp  or  xj;  at  a  constant  position  of  the  centre  of 
mass  lead  to  rotation  of  the  rigid  body.  Hence,  the  correspond¬ 
ing  degrees  of  freedom  are  said  to  be  rotary.  Additional  degrees 
of  freedom  of  oscillatory,  or  vibratory,  motion  (10.6.5)  are 
introduced  to  determine  the  position  in  space  of  an  imperfectly 
rigid  body,  whose  various  parts  can  be  displaced  with  respect 
to  one  another. 

10.6.2  In  many  problems  a  molecule  of  a  monatomic  gas  can 
be  regarded  as  a  particle  (cf  Sect.  8.4.1).  This  is  based  on  the 
fact  that  practically  all  the  mass  of  such  an  atom  is  concentrated 
in  its  nucleus  (42.1.1),  which  is  of  extremely  small  size.  The 
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molecule  of  monatomic  gas  has  three  degrees  of  freedom  of 
translational  motion.  Its  average  kinetic  energy  (E™ )  is  equal 
to  the  kinetic  energy  of  molecules  travelling  at  a  velocity 
equal  to  the  root-mean-square  velocity  arms  (10.2.4): 


Owing  to  the  random  nature  of  thermal  motion  of  the  mole¬ 
cules,  this  energy  is  uniformly  distributed  between  the  three 
degrees  of  freedom.  Hence,  on  an  average  each  degree  of  free- 


Fig.MO.6 


(lorn  of  translational  motion  of  a  monatomic  molecule  accounts 
for  the  same  amount  of  kinetic  energy  )  —  (E™)/3  = 
■=  kTl 2,  where  k  is  Boltzmann’s  constant  and  T  is  the  absolute 
temperature. 

10.6.3  As  a  first  approximation,  a  molecule  of  a  diatomic  gas 
consists  of  two  atoms,  A  and  B ,  rigidly  held  together  by  an 
incompressible,  undeformable  bond  (Fig.  10.5).  In  addition 
to  the  three  degrees  of  freedom  of  translational  motion,  the 
centre  of  mass  C  (2.3.3)  of  such  a  molecule  has  two  more  degrees 
of  freedom  of  rotary  motion  about  axes  0l-01  and  02-02.  Rota¬ 
tion  about  the  third  axis  O' -O'  makes  no  contribution  to  the 
molecule’s  energy  because  the  moment  of  inertia  (4.2.1)  of 
the  atoms  with  respect  to  this  axis  is  negligibly  small.  Thus, 
a  diatomic  molecule  has  five  degrees  of  freedom.  Molecules 


it* 
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consisting  of  three  or  more  atoms  have  six  degrees  of  freedom: 
three  translational  and  three  rotary  degrees  (Fig.  10.6). 

10.6.4  The  principle  of  equipartition  of  energy  states  that 
on  an  average  each  degree  of  freedom  of  a  molecule  accounts 
for  the  same  amount  of  kinetic  energy,  equal  to  kTl2. 1  f  a  mol- 
ecule  has  i  degrees  of  freedom  its  average  kinetic  energy 
(E™)=ikT/  2. 

The  principle  of  equipartition  of  energy  leads  to  the  conclusion 
that  all  degrees  of  freedom  of  the  molecule  are  equally  justified: 
each  makes  the  same  contribution  to  its  average  energy.  This 
conclusion  actually  has  only  a  limited  field  of  application,  and 
has  been  revised  for  quantum  statistics  (41.1.1). 

10.6.5  In  real  molecules,  which  are  not  absolutely  rigid,  vibra¬ 
tory  degrees  of  freedom  make  a  contribution  to  the  energy  of 
the  molecule  in  accordance  with  the  principle  of  equipartition 
of  energy  (10.6.4).  Each  vibratory  degree  of  freedom  accounts, 
not  only  for  kinetic  energy  E™,  but  for  potential  energy  E™ 
as  well.  For  harmonic  vibrations  (28.1.3),  (fi]^)  =  (£^). 
According  to  Sect.  10.6.4 

(Ejf0)=(E™)=i-kr, 

where  (/:]")  and  (A1™)  are  the  average  kinetic  and  potential 
energies  of  a  molecule  per  vibratory  degree  of  freedom. 

The  average  value  (A'J1 )  of  the  total  energy  of  a  molecule  per 
vibratory  degree  of  freedom  is 

TO  =  +  <£p0>  =  2  =  kT. 


This  equation  is  valid  for  harmonic  vibrations  of  particles 
(atoms,  molecules  or  ions)  in  the  crystal  lattice  sites  of  sol¬ 
ids  (8.1.4). 

10.6.6  The  internal  energy  (9.1.2)  of  a  polyatomic  gas  consti¬ 
tutes  the  kinetic  energy  of  all  kinds  of  motion  of  its  particles. 
For  one  mole  of  such  a  gas 


7  Tt  ri 

U  =  (E™)NA  =  ±-kNAT  =  i-^ 
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where  N a  is  Avogadro’s  number  (Appendix  II),  i  is  the  number 
of  degrees  of  freedom  of  the  molecule  (10.6.4),  and  R  is  the 
universal  gas  constant  (8.4.4). 


10.7  Heat  Capacity  of  Monatomic,  Diatomic  and 
Polyatomic  Gases 


10.7.1  The  molecules  of  a  monatomic  gas  have  three  degrees 
of  freedom  (10.6.2)  and,  according  to  Sect.  10.6.6,  the  inter¬ 
nal  energy  of  one  mole  of  such  a  gas  is  U  —  3kN &T/2. 

The  molar  heat  capacity  of  such  a  gas  at  constant  volume  (9.5.4) 

is 


'v»“-§-=T  A'i*“TR"1246-5 


_i_f. 

mol-K  V 


joules 


mole  X  kelvin 


cal 

mol-K 


because  R  ^  2 


cal 

mol-K  ‘ 


For  a  polyatomic  gas  whose  molecules  have  i  degrees  of  freedom 

.,  iR  ,  .  J  .  cal 

^  = — =4.16 ,  s* » • 


10.7.2  The  molar  heat  capacity  Crtl  at  constant  pressure  (9.5.7 
and  9.5.8)  of  a  gas  whose  molecules  have  i  degrees  of  freedom  is 


,'i  m  — 


(i  +  2)  R 


=  4.16  (i  +  2) 


mol-K 


(i  +  2) 


cal 

mol-K  * 


The  principle  of  cquipartition  of  energy  (10.6.4)  leads  to  the 
conclusion  that  the  heat  capacity  of  a  gas  depends  upon  the 
number  of  degrees  of  freedom  of  its  molecules,  but  does  not 
depend  on  the  temperature.  Experimental  data  disprove  these 
conclusions  of  the  classical  theory  of  heat  capacity.  Investiga¬ 
tions  indicate  that  the  heat  capacity  of  gases  increases  and 
decreases  with  temperature. 

10.7.3  A  theoretical  explanation  of  experimental  data  on  the 
dependence  of  heat  capacity  on  temperature  in  a  wide  tem¬ 
perature  range  is  given  in  the  quantum  theory  of  heat  capacity. 
This  theory  takes  into  account  the  following  results  obtained 
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in  a  quantum-mechanical  description  of  the  properties  of 
diatomic  and  polyatomic  molecules: 

(a)  Of  the  four  parts  that  comprise  the  energy  Em  of  the  mole¬ 
cule*  (39.5.2),  i.e. 


=  Ef  +  Ef  +  Ef  -f  Ef, 


only  Ef,  the  energy  of  translational  motion  of  the  molecule’s 
centre  of  mass  (2.3.3),  varies  continuously  (not  by  discrete 
amounts).  The  values  of  all  other  kinds  of  internal  energy 
of  molecules  are  quantized  and  thereby  assume  only  a  discrete 
series  of  values  (39.4.5).  This  refers  to  the  following  kinds  of 
energy:  Ef,  the  energy  of  motion  of  the  electrons  of  the  mole¬ 
cule;  Ef,  the  energy  of  vibrational  motion  of  the  nuclei  in  the 
molecule;  and  Ef,  the  energy  of  rotary  motion  of  the  molecule. 

(b)  As  a  first  approximation,  the  kinds  of  internal  motion 
of  the  molecule  can  be  assumed  independent  of  one  another. 
Thus,  at  small  amplitudes  of  vibration  of  the  nuclei,  the  changes 
in  the  moments  of  inertia  (4.2.1)  of  the  molecule  due  to  vibra¬ 
tion  can  be  neglected,  as  can  the  effect  of  the  vibrational  motion 
in  the  molecule  on  its  rotation. 

(c)  No  changes  in  the  energy  of  motion  of  the  electrons  occur 
when  a  substance  is  heated  by  one  degree  at  ordinary  tempera¬ 
tures.  Adjacent  energy  levels  of  the  electrons  in  the  molecule 
are  separated  by  energy  of  the  order  of  several  eV,  correspond¬ 
ing  to  temperatures  of  several  tens  of  thousands  of  degrees 
(10.7.6).  Therefore,  the  energy  of  motion  of  electrons  in  the 
molecule  can  be  neglected  in  dealing  with  the  heat  capacities 
of  monatomic  and  polyatomic  gases  up  to  the  very  highest 
temperatures. 

10.7.4  As  a  first  approximation,  the  vibrational  motion  of 
the  nuclei  in  the  molecule  is  described  as  the  vibration  of 
a  harmonic  oscillator  (38.5.4),  whose  mass  m  is  equal  to  the 
reduced  mass  of  a  system  of  vibrating  atoms,  m  =  m1m2l(m1  + 


*  This  ignores  the  nuclear  energy  of  the  molecule,  which  has 
no  influence  on  the  heat  capacity  of  molecules. 
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+  m2),  where  m1  and  m2  are  the  masses  of  the  atoms.  Then  the 
energy  of  vibrational  motion  of  the  molecule  equals 

#v  =hv0  (re  +  y)  > 


where  n  is  the  quantum  number  and  assumes  integral  values: 
n  =  0,  1,  2,  .  .  v0  is  the  natural  frequency  of  vibrations 
(38.5.5),  and  h  is  Planck’s  constant  (Appendix  II).  The  energy 
of  a  harmonic  oscillator,  equal  to  h\0l 2  at  n  —  0,  is  conserved 
at  any  arbitrarily  low  temperatures,  and  even  at  T  0  (38.5.6). 
The  difference  A E™  between  the  energies  of  two  adjacent  vibra¬ 
tional  energy  levels,  A  A™  =  hv0,  is  independent  of  the  quantum 
number.  Valid  at  room  and  lower  temperatures  is  the  condi¬ 
tion:  A E™  >  kT.  No  changes  are  observed  in  the  energy  of 
vibrational  motion  of  molecules  at  such  temperatures.  In  cal¬ 
culating  the  heat  capacity  of  gases  under  such  conditions, 
vibrational  degrees  of  freedom  of  the  molecule  can  be  neglected. 
In  the  general  case,  vibrational  motions  make  contributions 
to  the  internal  energy  Uv  (9.1.2)  and  the  heat  capacity  Cvu 
(9.5.2).  For  one  mole 


where  T  is  the  absolute  temperature,  k  is  Boltzmann’s  constant 
(8.4.5)  and  NA  is  Avogadro’s  number  (Appendix  II).  At  high 
temperatures  (T  »  hv/lc) 

UvxNAkT  =  RT,  CVllv  x  NAk  -  R. 

These  results  coincide  with  those  of  classical  theory  which 
follow  from  the  principle  of  the  equipartition  of  energy  (10.6.4). 
Under  these  conditions  the  energy  of  the  oscillator  can  be 
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assumed  to  vary  continuously  because  A E™  <  kT.  At  suf¬ 
ficiently  low  temperatures  ( T  <c  hv/k) 


Nj^hv 


f-  N jjive 
hv  \  2 


cny~ir*t(-gr) 


hv 

hT 


hv 

~hT~ 


and 


As  T  -*■  0,  the  energy  U v 
the  zero-point  vibrational 


4 


-*■  N \hv/2.  This  quantity  is  called 
energy  of  the  system  (38.5.6).  As 
T  -s-  0,  the  heat  capacity  CV(t  - 
0  in  accordance  with  the  third 
law  of  thermodynamics  (11.8.4). 
The  dependence  CV]ly  =  /  (T)  for 
diatomic  gases  is  shown  in  Fig. 
10.7. 

10.7.5  As  a  first  approximation, 
the  rotary  motion  of  diatomic 
molecules  can  be  dealt  with  as 
the  motion  of  a  rigid  dumbbell 
which  rotates  about  its  centre 
of  mass  with  the  moment  of 
inertia  I  =  m1m2r%/(ml  +  m2),  where  m  =  m1m2/(7re]  +  m2)  is 
the  reduced  mass  of  the  molecule  (ml  and  m2  are  the  masses 
of  the  two  atoms),  and  r„  is  the  distance  between  the  atoms 
in  the  molecule.  The  energy  of  such  a  system  is 
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Fig.  10.7 
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where  /  is  a  quantum  number  that  assumes  the  values  J  = 
=  0,  1,  2,  ...  .  The  difference  A E™  between  the  energies 
of  two  adjacent  rotational  energy  levels  is 
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The  value  of  &E™  is  less  than  A£™  by  a  factor  of  800  to  1000. 
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At  ordinary  temperatures  A E™  <  kT  for  diatomic  and  certain 
polyatomic  gases  (water  vapour,  methane,  etc.),  and  the  quan¬ 
tization  of  the  energy  of  rotary  motion  of  the  molecules  can 
be  ignored.  Under  these  conditions,  the  principle  of  equiparti- 
tion  of  energy  (10.6.4)  can  be  applied  in  calculating  the  contri¬ 
bution  of  rotary  motion  to  the  heat  capacity. 

The  contribution  of  rotary  motion  of  diatomic  molecules  to 
the  internal  energy  and  heat  capacity  per  mole  equals: 

(a)  At  high  temperatures  ( T  >  h?/8n2kl) 

Ur^  NAkT  (l-  2A^)kT)  ,  and  CVllT^NAk  =  R. 


The  heat  capacity  CViir  at  high  temperatures  has  a  value  fol¬ 
lowing  from  the  principle  of  equipartition  of  energy  (10.6.4). 

(b)  At  low  temperatures  (T  <  h2/8n2kl) 
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as  T -*■  0,  the  heat  capacity  CVIXr  0  (10.7.4).  The  general 
shape  of  the  curve  representing  the  dependence  CVflr  =  /  (T) 
is  the  same  as  that  in  Fig.  10.7. 

As  the  gas  cools,  the  reduction  in  the  energy  of  its  molecules 
leads  to  a  reduction  in  the  number  of  molecules  transferring 
to  higher  rotational  energy  levels.  At  sufficiently  low  tem¬ 
peratures,  the  rotation  of  the  molecules  in  a  gas  practically 
cannot  be  excited,  and  rotational  degrees  of  freedom  make 
no  contribution  to  the  heat  capacity  of  the  gas.  The  heat  capa¬ 
cities  of  all  gases  at  low  temperatures  become  the  same  as  for 
monatomic  gases  whose  molecules  have  no  rotational  degrees 
of  freedom. 

10.7.6  At  extremely  high  temperatures,  of  the  order  of  tens 
of  thousands  of  degrees,  the  heat  capacity  is  increased.  This  is 
associated  with  the  fact  that  the  value  of  AT  under  such  condi¬ 
tions  becomes  comparable,  in  order  of  magnitude,  with  A.E™, 
the  change  in  the  energy  of  the  electrons  in  their  transition 
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from  lower  to  higher  energy  levels.  Moreover,  at  high  tem¬ 
peratures  the  processes  of  dissociation  and  ionization  of  the 
gases  make  contributions  to  their  heat  capacities. 


10.8  Transport  Phenomena  in  Gases 

10.8.1  Transport  phenomena  unite  a  group  of  processes  pertain¬ 
ing  to  nonuniformities  in  the  density,  temperature  or  velocity 
of  ordered  motion  of  various  layers  of  a  substance.  The  equali¬ 
zation  of  these  nonuniformities  leads  to  transport  phenomena. 
Transport  phenomena  include  diffusion,  internal  friction  and 
heat  conduction. 

10.8.2  Transport  phenomena  in  gases  and  liquids  consist  of 
ordered  directional  mass  transport  in  these  substances  (diffu¬ 
sion),  momentum  transport  (internal  friction)  and  internal 
energy  transport  (heat  conduction).  In  gases  these  phenomena 
violate  the  completely  random  motion  of  the  molecules  and 
Maxwell’s  molecular  velocity  distribution  (10.3.2).  Deviations 
from  the  Maxwell  law  can  be  explained  by  the  directional 
transport  of  the  physical  characteristics  of  the  substance  in 
such  phenomena.  In  the  simplest  cases  of  one-dimensional 
transport  phenomena,  the  physical  quantities  that  govern 
them  depend  on  only  a  single  Cartesian  coordinate. 

10.8.3  Diffusion  is  the  spontaneous  interpenetration  and  mixing 
of  the  particles  of  two  adjoining  gases,  liquids  or  solids.  In 
chemically  pure  gases  at  constant  temperature,  diffusion  occurs 
owing  to  differing  densities  in  various  parts  throughout  the 
volume  Of  the  gas.  In  a  mixture  of  gases,  diffusion  is  caused 
by  the  difference  in  the  densities  of  the  various  gases  at  various 
parts  of  the  volume  of  the  gas  mixture. 

In  a  chemically  homogeneous  gas  the  phenomenon  of  diffusion 
consists  in  the  transport  of  a  mass  of  the  gas  from  places  with 
higher  density  to  places  with  lower  density.  This  phenomenon 
obeys  Fick's  law: 

„  _  n  <*P 

mSec  -  “  u  fo-  ■ 

Here  msec  is  the  specific  mass  flow  and  is  numerically  equal 
to  the  mass  that  passes  by  diffusion  in  unit  time  through  a  plane 
surface  of  unit  area  located  perpendicular  to  the  direction  the. 
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substance  is  being  transported,  p  is  the  density  of  the  gas, 
and  D  is  the  diffusion  coefficient.  The  derivative  dp/dx  is  numer¬ 
ically  equal  to  the  density  gradient,  i.e.  the  change  in  density 
per  unit  length.  The  diffusion  coefficient  is  numerically  equal 
to  the  specific  mass  flow  at  unit  density  gradient.  The  minus 
sign  in  Fick’s  law  indicates  that  mass  transport  takes  place 
in  the  direction  of  decreasing  density. 

Another  form  of  Fick’s  law  is 


where  j  =  msec/m  is  the  density  of  the  molecular  flow  in  dif¬ 
fusion,  i.e.  the  number  of  molecules  diffusing  in  unit  time 
through  the  surface  indicated  in  the  first  form  of  Fick’s  law, 
nB  is  the  molecular  concentration,  equal  to  the  number  of 
molecules  per  unit  volume,  and  m  is  the  mass  of  one  molecule, 
so  that  p  =  n0m. 

The  expression  obtained  in  the  kinetic  theory  of  gases  for  the 
diffusion  coefficient  is 

D=±(u)  {%), 

where  (u>  is  the  arithmetic  mean  velocity  of  thermal  motion 
of  the  molecules  (10.3.6),  and  (X)  is  the  mean  free  path  (10.5.1). 
10.8.4  Internal  friction  ( viscosity )  is  the  occurrence  of  friction 
forces  between  layers  of  a  gas  or  liquid  that  flow  parallel  to 
one  another  at  different  speeds.  A  layer  flowing  faster  exerts 
an  accelerating  force  on  a  slower  layer.  On  the  other  hand, 
a  slow  layer  retards  a  faster  flowing  layer  of  fluid.  The  forces 
of  internal  friction  (viscosity)  are  directed  tangent  to  the 
contacting  surfaces  of  the  layers.  The  cause  of  viscosity  (as 
internal  friction  in  gases  and  liquids  is  more  commonly  called) 
is  the  superposition  of  ordered  motion  of  layers  of  fluid  at 
various  velocities  v  and  the  thermal  random  motion  of  the 
molecules  at  velocities  depending  upon  the  temperature.  The 
random  motion  of  the  molecules  transfers  them  from  layer  B 
flowing  at  the  velocity  v2  into  layer  A  flowing  at  the  velocity  Vj 
(Fig.  10.8).  This  leads  to  transport  of  momenta  mv  of  ordered 
motion  of  the  molecules.  When  V!  >  v2,  the  molecules  that 
entered  layer  A  from  layer  B  have  their  velocities  of  ordered 
motion  accelerated  by  collisions  with  the  molecules  from  the 
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new  layer.  This  slows  down  the  velocities  of  ordered  motion 
of  the  molecules  in  layer  A.  On  the  other  hand,  molecules 
passing  from  faster  layer  A  to  slower  layer  B  have  higher  mo¬ 
menta  mv,  so  that  the  inter- 
molecular  collisions  in  layer  B 
accelerate  the  motion  of  its  mol¬ 
ecules. 

The  phenomenon  of  viscosity  is 
described  by  the  Newtonian  fric¬ 
tion  law: 


i  !___/ 


Fig.  10.8 


where  %  is  the  friction  stress,  i.e. 
the  physical  quantity  equal  nu¬ 
merically  to  the  force  of  internal  friction  acting  per  unit 
area  of  the  surface  of  the  layer,  dv/dn  is  the  velocity  gradient, 
i.e.  the  variation  in  the  velocity  of  layer  motion  per  unit  length 
in  the  direction  of  the  inner  normal  n  to  the  layer  surface. 
The  internal  frictional  force  is  opposite  in  direction  to  the 
derivative  of  the  gas  velocity  vector  v  with  respect  to  n. 

The  quantity  T]  is  called  the  coefficient  of  dynamic  viscosity 
(or  just  the  viscosity).  It  is  numerically  equal  to  the  friction 
stress  at  a  velocity  gradient  equal  to  unity. 

The  coefficient  of  kinematic  viscosity  is  the  quantity  v  =  T)/p, 
where  p  is  the  density  of  the  substance. 

The  coefficient  of  dynamic  viscosity  is  calculated  by  the  equa¬ 
tion 


11  =  W  P> 


where  ( u }  is  the  arithmetic  mean  velocity  of  thermal  motion 
of  the  molecules  (10.3.6),  and  (X)  is  their  mean  free  path 
(10.5.1).  The  coefficient  of  dynamic  viscosity  is  independent 
of  the  pressure  (or  density)  of  the  gas  since  (X)  ex  1/p.  This 
can  be  explained  by  the  fact  that  in  an  isothermal  increase 
in  the  density  of  a  gas,  for  example,  by  two  times,  the  number 
of  carriers  of  momenta  is  doubled,  but  each  molecule  (or  atom) 
travels  only  one  half  the  distance  without  a  collision  and  carries 
only  one  half  as  much  momentum.  Hence,  all  in  all,  momenta 
transport  remains  unchanged, 
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10.8.5  The  third  transport  phenomenon,  heat  conduction,  occurs 
under  the  condition  that  a  difference  in  temperature  exists 
in  a  body  in  a  certain  direction.  For  instance,  the  two  opposing 
walls  of  a  vessel  holding  a  gas  can  have  different  temperatures 
that  are  maintained  by  external  sources.  Then  the  gas  molecules 
at  various  locations  throughout  the  volume  have  different 
average  kinetic  energies  (10.2.4).  Under  such  conditions,  the 
random  thermal  motion  of  the  molecules  leads  to  directional 
energy  transport  (or  transfer)  in  the  form  of  heat  (9.2.6).  Mol¬ 
ecules  passing  from  the  heated  parts  of  the  volume  of  gas  to 
the  colder  parts  yield  some  of  their  average  kinetic  energy 
to  the  surrounding  molecules  in  the  process  of  molecular  colli¬ 
sions.  Slow  molecules,  on  the  contrary,  increase  their  average 
kinetic  energy,  in  passing  from  the  less  heated  to  the  more  heated 
parts  of  the  gas,  owing  to  collisions  with  molecules  having 
higher  velocities. 

in  one-dimensional  heat  conduction,  when  the  temperature 
of  the  gas  depends  only  on  a  single  coordinate  T  =  T  (x), 
energy  transfer  in  the  form  of  heat  proceeds  along  the  OX  axis 
according  to  the  Fourier  law  of  heat  conduction  (Fourier  heat 
equation): 


„  dT 

gsec - A  —  , 


where  qsec  is  the  specific  rate  of  heat  flow,  a  physical  quantity 
numerically  equal  to  the  energy  transmitted  in  the  form  of  heat 
in  unit  time  through  a  plane  surface  of  unit  area  located  per¬ 
pendicular  to  the  direction  of  energy  transport.  The  quantity  K 
is  called  the  thermal  conductivity.  It  is  numerically  equal  to  the 
specific  rate  of  heat  flow  at  the  temperature  gradient  dTtdx 
(the  change  in  temperature  over  unit  length)  equal  to  unity. 
The  minus  sign  in  the  Fourier  law  indicates  that  in  heat  con¬ 
duction  energy  is  transported  in  the  direction  of  decreasing 
temperatures. 

According  to  the  kinetic  theory  of  gases,  the  thermal  conduc¬ 
tivity  is  equal  to 

A  =  (n)  (k>  cvp, 


where  cv  is  the  specific  heat  at  constant  volume  (9.5.4). 
The  other  quantities  are  defined  in  Sect.  10.8.4.  It  follows 
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from  the  last  equation  that  the  thermal  conductivity  is  inde¬ 
pendent  of  the  gas  density.  The  reasons  are  similar  to  those 
given  in  Sect.  10.8.4  to  explain  why  tj  is  independent  of  the 
densi  ty. 

10.8.6  The  laws  of  transport  phenomena,  given  in  the  subsec¬ 
tions  10.8.3,  10.8.4  and  10.8.5,  can  be  written  in  another  form: 

dM  —  — D  -4^-  dS  dt,  dF=-r\-^-dS  and 
ax  an 

dQ  —  —  K  dS  dt. 


Here  dM  is  the  mass  transported  in  diffusion  during  the  time 
dt  through  the  element  of  area  dS,  located  perpendicular  to 
the  direction  of  diffusion;  dQ  is  the  amount  of  energy  transported 
in  the  form  of  heat  during  the  time  dt  through  the  same  element 
of  area  dS,  located  perpendicular  to  the  OX  axis;  and  dF  is 
the  force  of  internal  friction  (viscous  force)  exerted  on  an  ele¬ 
ment  of  surface  of  the  layer  having  the  area  dS.  The  other 
quantities  are  defined  in  the  subsections  mentioned  above. 


TABLE  10.1 


Phenomenon 

Transported 

physical 

quantity 

Transport  equa¬ 
tion 

Formula  for  the 
transport  coefficient 

Diffusion 

Mass 

dM 

=  -D^-dS  dt 
ax 

D  =  ±(u)(fy 

Internal 

friction 

(viscosity) 

Momentum 

dF  =  —  ’n~7"  dS 
'  dn 

q=i(n)  (X)  p 

Heat  con¬ 
duction 

Energy  in 
the  form  of 
heat 

dQ 

=  - K~dSdt 
ax 

K  =  -|  {«}(*,)  Cy p 
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10.8.7  Simple  relations  exist  between  the  transport  coefficients: 

ri=p  D  and  — - —  =  1. 

r\cv 

If  any  one  of  the  transport  coefficients  is  known  the  others  can 
be  found  by  these  formulas  (for  known  values  of  p  and  cv). 
When  the  coefficients  are  known,  such  vital  characteristics 
of  a  gas  can  be  determined  as  the  mean  free  path  (10.5.1)  of 
its  molecules  and  the  effective  diameter  of  the  molecules  (8.4,1). 
Information  on  transport  phenomena  is  listed  in  Table  10.1. 


10.9  Properties  of  Rarified  Gases 

10.9.1  A  gas  in  said  to  be  rarified  (a  dilute  gas)  if  its  density 
is  so  low  that  the  mean  free  path  &)  of  the  molecules  is  com¬ 
parable  to  the  linear  dimensions  d  of  the  vessel  confining  the 
gas.  Such  a  state  of  a  gas  is  also  called  a  vacuum.  The  degrees 
of  vacuum  of  gases  are  classified  as  ultrahigh  ((X)  >  d),  high 
((X)  >  d),  medium  ((.X)  ^  d)  and  low  ({X)  <  d).  The  proper¬ 
ties  of  rarified  gases  differ  from  those  of  nonrarified  gases  only 
for  the  first  three  of  these  degrees  of  vacuum.  This  is  evident 
from  Table  10.2  which  lists  certain  characteristics  for  the 
various  degrees. 

10.9.2  In  the  stale  of  a  high  vacuum  (10.9.1)  a  reduction  in  the 
density  of  a  rarified  gas  leads  to  a  corresponding  decrease  in 
the  number  of  particles  without  changing  (X).  This,  conse¬ 
quently,  reduces  the  number  of  carriers  of  momentum  or  inter¬ 
nal  energy  in  viscosity  and  heat  conduction.  The  transport 
coefficients  in  these  phenomena  are  directly  proportional  to  the 
gas  density  (cf.  Sects.  10.8.4  and  10.8.5).  Internal  friction  is 
essentially  absent  in  highly  rarified  gases.  Instead,  only  exter¬ 
nal  friction  exists  between  the  moving  gas  and  the  walls  of 
the  vessel.  This  is  due  to  the  fact  that  the  molecules  change 
l  heir  momentum  only  in  interacting  with  the  walls.  Under 
iliese  conditions,  as  a  first  approximation,  the  friction  stress  is 
proportional  to  the  density  of  the  gas  and  velocity  of  its  motion 
(cf.  Sect.  10.8.4).  The  specific  rate  of  heat  flow  in  highly  rarified 
gases  is  proportional  to  the  temperature  difference  and  to  the 
gas  density  (cf.  Sect.  10.8.5). 
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TABLE  10.2 


Characteristic 

Vacuum 

Low 

Medium 

High 

Ultrahigh 

Pressure, 

mmHg 

760  to  1 

1  to  10-3 

IO-3  to  10~7 

10~8  and  less 

Number  of 
molecules  in 
unit  volume 
(per  m-3) 

1025  to 
1Q22 

1022  to  1019 

1019  to  1013 

1013  and  less 

Dependence 
of  coeffic¬ 
ients  K  and 
i]  on  the  pres¬ 
sure 

None 

Pressure 
dependence 
is  deter¬ 
mined  by  the 
parameter 
(k)/d 

Directly 
proportional 
to  pressure 

Heat  con¬ 
duction  and 
viscosity  are 
practically 
absent 

10.9.3  A  steady  state  of  a  rarified  gas,  contained  in  two  vessels 
connected  by  a  narrow  tube,  is  possible  under  the  condition  of 
equality  of  the  counterflow  of  particles  passing  between  the 
vessels:  nx  {u-ff)  =  n2  <u2),  where  %  and  n2  are  the  numbers  of 
molecules  per  cm3  in  the  two  vessels,  and  {ux )  and  («2)  are 
their  arithmetic  mean  velocities  (10.3.6). 

If  T1  and  J\  are  the  gas  temperatures  in  the  two  vessels,  the 
preceding  condition  for  a  steady  state  can  be  rewritten  in  the 
form  of  an  equation  that  expresses  the  Knudsen  effect : 


Pi_  = 

Pi 


where  px  and  p2  are  the  pressures  of  the  rarified  gas  in  the  two 
vessels. 
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CHAPTER  11  SECOND  LAW  OF  THERMODYNAMICS 

11.1  Cycles.  The  Carnot  Cycle 

11.1.1  A  cycle  is  a  succession  of  thermodynamic  processes 
(8.3.7)  as  a  result  of  which  the  system  returns  to  its  initial  state. 
In  pV,  pT  and  other  thermodynamic  diagrams,  equilibrium 
cycles  (8.3.7)  are  represented  in  the  form  of  closed  curves,  be¬ 
cause  two  identical  states, 
the  beginning  and  end  of 
the  cycle,  correspond  to  the 
same  point  of  the  diagram. 

A  thermodynamic  system 
that  accomplishes  a  cycle 
and  exchanges  energy  with 
other  bodies  is  called  the 
working  medium,  or  sub¬ 
stance.  This  medium  is 
usually  a  gas. 

11.1.2  The  arbitrary  equi¬ 
librium  cycle  CxaC2bCx 
(Fig.  11.1),  using  an  ideal 
gas,  can  be  divided  into  the 
process  of  expansion  of  the 
gas  from  state  C,  to  state  C2 
(curve  CxaC2)  ana  the  process 
of  compressing  the  gas  from  state  C2  to  state  Cx  (curve  C2bCx). 
In  expansion  the  gas  does  positive  work  Wx,  measured  by  the 
area  under  the  curve  CxaC2  (the  area  of  the  figure  VxCxaC2V2; 
see  Sect.  9.4.2).  The  gas  is  compressed  by  external  forces  that 
do  positive  work  W'2  =  —  W2,  measured  by  the  area  under 
the  curve  CxbC2  (the  area  of  the  figure  VxCxbC2V2).  Since  Wx  > 
>  W2  (Fig.  11.1),  the  gas  does  the  positive  work  W  =  Wx  -\- 
■  |-  W2  =  Wx  —  W2  in  the  cycle.  This  work  is  measured  by 
the  hatched  area,  hounded  by  the  curve  C1aC2bC1  of  the  pro¬ 
cess. 

11.1.3  A  direct  cycle  is  one  in  which  the  system  does  positive 

work  p  dV  >  0.  In  a  pV  diagram  a  direct  cycle  is  repre¬ 

sented  by  a  closed  curve  along  which  the  working  medium 
passes  in  the  clockwise  direction  (Fig.  11.1).  An  example  of  a  di- 


Fig.  11.1 
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roct  cycle  is  that  accomplished  by  the  working  medium  in  a 
heat  engine.  In  such  an  engine  the  working  medium  (11.1.1) 
receives  energy  in  the  form  of  heat  (9.2.1)  from  external  sources 
and  gives  a  part  of  it  up  in  the  form  of  work  (9.2.1)  (see  Sect. 
11.1.5). 

11.1.4  A  reverse  cycle  is  one  in  which  the  system  does  negative 
work  W  —  (fi  p  dV  <  0.  In  a  pV  diagram  a  reverse  cycle  is 


represented  by  a  closed  curve  along  which  the  working  medium 
passes  in  the  counterclockwise  direction.  An  example  of  a  re¬ 
verse  cycle  is  that  accomplished  by  the  working  medium  in  a 
refrigerator.  In  such  a  device  the  working  medium  receives 

energy  in  the  form  of  work  and 
transmits  energy  in  the  form  of 
heat  from  a  cold  body  to  a  hot¬ 
ter  one  (see  Sect.  11.1.5). 

11.1.5  Since  the  total  change 
in  the  internal  energy  of  a  gas 
in  a  cycle  is  equal  to  zero 
(9.1.3),  the  first  law  of  thermo¬ 
dynamics  (9.3.1)  for  such  a 
process  is  of  the  form 

Q  =  At/  +  W  =  W, 


where  Q  is  the  total  quantity  of 
Fig.  11.2  heat  delivered  to  the  gas  in  the 

cycle,  and  W  is  the  work  done 
by  the  gas  in  this  cycle. 

In  a  direct  cycle  (11.1.3)  Q  >  0  and  A  >  0,  and  the  gas  does 
work  owing  to  the  heat  delivered  to  it.  In  a  reverse  cycle  (11.1.4) 
the  work  done  on  the  gas  is  W'  =  —W  (where  W  >  0)  and  a 
quantity  of  heat  equivalent  to  this  work  is  removed  from  the  gas. 

11.1.6  The  Carnot  cycle  is  the  one  shown  in  Fig.  11.2.  The 
direct  Carnot  cycle  consists  of  four  consecutive  processes:  isother¬ 
mal  expansion  1-1'  at  the  temperature  T1  (where  Tp  =  7\), 
adiabatic  expansion  l'-2,  isothermal  compression  2-2'  at 
the  temperature  T2  (where  ?y  =  T2),  and  adiabatic  compres¬ 
sion  2'-l. 

11.1.7  In  practice  a  direct  Carnot  cycle  could  be  carried  out  by 
means  of  a  gas  enclosed  in  a  vessel  with  a  movable  piston.  In 
the  process  1-1'  the  gas  is  in  thermal  contact  with  a  heat 
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source  (heater)  at  the  constant  temperature  T*.  From  the  heater 
the  gas  receives  a  certain  quantity  of  heat  Qt  (where  Q1  >•  0). 
It  is  assumed  that  this  does  not  change  the  temperature  of  the 
heater.  This  is  possible,  strictly  speaking,  only  if  the  heater 
has  an  infinite  heat  capacity.  In  the  process  l'-2  the  gas  is 
thermally  isolated  and  its  expansion  is  adiabatic**  (9.5.10). 
In  the  process  2-2'  the  gas  is  brought  into  thermal  contact 
with  a  cooler  ( heat  sink)  having  the  constant  temperature  T2 
(where  T 2  <  Tx).  Here  the  gas  is  isothermically  compressed 
and  gives  off  the  quantity  of  heat  —  Q2  to  the  cooler  (assuming 
that  Q  is  the  quantity  of  heat  received  by  the  gas  from  the  cool¬ 
er).  The  heat  capacity  of  the  cooler  is  assumed  to  be  infinitely 
high.  In  the  state  2'  the  gas  is  again  thermally  insulated  and 
contracts  adiabatically  to  its  initial  state  1. 

11.1.8  The  work  done  by  the  gas  in  an  equilibrium  (8.3.7) 
direct  Carnot  cycle  is 


W  =  Q  =  <?i  +  Qt  =  Q1  —  I  Q2  |. 


It  is  evident  from  the  equation  that  W  <  Q i,  i.e.  the  work  done 
by  the  working  medium  (11.1.1)  in  the  Carnot  cycle  is  less 
than  the  energy  received  from  the  heater  by  the  amount  of 
energy  given  up  to  the  cooler  in  the  form  of  heat.  This  is  valid 
for  an  arbitrary  process:  the  work  W  done  in  the  cycle  is  always 
less  than  the  sum  <2dei  of  all  the  quantities  of  heat  delivered  to 
the  working  medium  from  heaters. 

The  operation  of  heat  engines  is  based  on  the  direct  Carnot 
cycle. 

11.1.9  The  thermal  efficiency  r|  is  the  ratio  of  the  work  W  done 
by  the  working  medium  in  a  direct  cycle  to  the  sum  (?del  of  all 
quantities  of  heat  delivered  in  this  cycle  to  the  working  medium 
by  heaters: 

W 

11  ~  Cdel  • 

The  quantity  q  characterizes  the  economy  of  operation  of  a  heat 
engine. 


*  For  instance,  a  large  tank  filled  with  water. 

**  For  instance,  the  cylinder  with  the  gas  is  covered  with 
a  thick  layer  of  felt. 


2* 
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11.1.10  The  thermal  efficiency  of  a  direct  equilibrium  Carnot 
cycle  carried  out  with  an  ideal  gas  is 


t]G=  C1  +  &  —  Ti-Ti  =1. 
1,0  (?!  Tt 


The  value  of  rjc  depends  only  on  the  ratio  of  the  temperatures 
of  the  cooler  (T2)  and  the  heater  (Tt). 

11.1.11  The  reverse  Carnot  cycle  is  shown  in  Fig.  11.3.  In  iso¬ 
thermal  compression,  carried  out  in  the  process  l'-l,  the  quan¬ 
tity  of  heat  Q1  is  removed  from 
the  gas  at  the  temperature  Tu 
which  remains  constant.  In  the 
process  of  isothermal  expansion 
2'- 2  at  the  temperature  T2  < 
<  Tlt  the  quantity  of  heat  Q2  is 
delivered  to  the  gas.  In  the 
reverse  Carnot  cycle  Qt  <  0, 
Q 2  >  0  and  the  work  W  done  by 
the  gas  in  one  cycle  is  nega¬ 
tive:  W  =  (<?!  +  Q2)  <  0.  This 
conclusion  is  valid  for  any  re¬ 
verse  cycle.  If  the  working  me- 
Fig.  11.3  dium  carries  out  a  reverse  cycle, 

energy  can  be  transferred  in  the 
form  of  heat  (9.2.1)  from  a  cold 
body  to  a  hotter  one  at  the  expense  of  the  corresponding  work 
done  by  external  forces.  This  principle  is  used  in  all  refrigerat¬ 
ing  devices.  The  less  the  work  W  —  W  done  by  external 
forces  to  extract  the  quantity  of  heat  Q2  from  a  cold  body,  the 
higher  the  economical  efficiency  (coefficient  of  performance)  of 
a  refrigerator: 

Q»=Wi-Qt  =  w-^-= - 


where  T)  is  the  thermal  efficiency  of  the  direct  cycle  between  the 
same  temperatures  and  T2  (11.1.3  and  11.1.4). 
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11.2  Reversible  and  Irreversible  Processes 

11.2.1  A  thermodynamic  process  is  said  to  be  reversible  (rever¬ 
sible  process)  if  the  thermodynamic  system  (8.3.1),  in  carrying 
out  the  process  first  in  the  direct  and  then  in  the  exactly  re¬ 
versed  order,  returns,  together  with  all  the  external  bodies  with 
which  it  interacts,  to  its  initial  state.  In  other  words,  in  a  rever¬ 
sible  process  the  thermodynamic  system  returns  to  its  initial 
state  without  leaving  any  changes  in  the  surrounding  medium. 
A  necessary  and  sufficient  condition  for  the  reversibility  of  a 
thermodynamic  process  is  the  equilibrium  (8.3.3)  of  all  the 
consecutive  states  in  the  process.  Here  all  corresponding  states 
of  the  thermodynamic  system  in  the  direct  and  reverse  pro¬ 
cesses  are  absolutely  identical. 

Example  1 .  The  mechanical  motion  of  a  body  in  a  vacuum  in  the 
complete  absence  of  friction  is  a  reversible  process.  Assume, 
for  instance,  that  a  body  under  these  conditions  is  thrown  with  a 
certain  initial  velocity  into  a  gravitational  field  (6.2.1)  at  a 
definite  angle  to  the  horizon.  After  describing  a  parabolic  path 
the  body  drops  to  the  earth  at  a  certain  place.  Now,  if  the  body 
is  thrown  from  this  place  at  the  same  angle  and  the  same  initial 
velocity,  but  in  the  opposite  direction,  it  describes  the  same 
path  in  the  opposite  direction  and  drops  to  the  earth  at  the 
initial  point.  Any  intermediate  states  of  the  moving  body  in 
the  forward  and  reverse  motions  are  entirely  identical.  The 
reversibility  of'  mechanical  motions  indicates  their  symmetry 
with  respect  to  the  interchanging  of  the  future  by  the  past, 
i.e.  with  respect  to  the  reversal  of  the  sign  of  the  time.  The  re¬ 
versibility  of  mechanical  motions  follows  from  the  differential 
equations  of  motion  (2.4.4).  When  the  sign  of  the  time  is 
changed,  this  changes  the  sign  of  the  velocity  as  well,  but  the 
acceleration,  in  the  motion  equations,  retains  its  sign. 
Example  2.  Another  reversible  process  is  the  undamped  vibra¬ 
tions  of  a  body  suspended  in  a  vacuum  on  a  perfectly  elastic 
spring  (28.2.3).  The  body-spring  system  is  a  conservative  one 
(3.1.7).  Its  mechanical  vibrations  do  not  change  the  energy  of 
thermal  random  motion  of  its  particles.  Only  changes  in  the 
configuration  and  velocity  of  the  system  lead  to  a  change  of  its 
state.  Rut  these  changes  are  completely  repeated  after  each 
period  T  (28.1.2)  of  vibration.  Hence,  the  conditions  for  the 
reversibility  (11.2.1)  of  the  process  are  complied  with. 

11.2.2  Any  process  that  does  not  comply  with  the  conditions 
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for  reversibility  is  said  to  be  irreversible  ( irreversible  process). 
Example  1.  The  direct  process  of  braking  a  moving  body  by 
means  of  friction  forces  is  an  irreversible  process.  If  these  forces 
are  the  only  ones  acting  on  the  body,  its  velocity  is  reduced 
and  it  finally  stops.  The  energy  of  mechanical  motion  of  the 
body  as  a  whole  is  reduced  and  is  expended  in  increasing  the 
energy  of  random  motion  of  particles  of  the  body  and  the  sur¬ 
rounding  medium.  The  internal  energy  (9.1.2)  of  the  body  and 
the  medium  is  increased;  they  are  heated  by  the  action  of  the 
friction  forces.  The  direct  process  dealt  with  here  proceeds  spon¬ 
taneously;  it  is  carried  out  without  any  processes  taking  place 
in  surrounding  bodies.  For  the  reverse  process  to  occur,  return¬ 
ing  the  system  to  its  initial  state,  it  would  be  necessary  for 
the  braked  and  stopped  body  to  start  moving  again  by  cooling 
it  and  the  surrounding  medium.  As  is  evident  from  experiments, 
the  thermal  random  motion  of  particles  of  a  body  cannot  spon¬ 
taneously  lead  to  the  initiation  of  ordered  motion  of  all  the 
particles  of  the  body  as  a  whole.  To  accomplish  such  motion  it 
is  necessary  to  provide  for  an  additional  compensating  process  in 
which  the  body  and  surrounding  medium  are  cooled  to  the 
initial  temperature.  In  this  process,  the  quantity  of  heat  Q  is 
given  up  to  the  cooler  and  the  work  A'  =  Q  is  performed  on 
the  body. 

Thus,  the  consecutive  performance  of  such  direct  and  reverse 
processes  returns  the  body-medium  system  to  its  initial  state, 
but  the  states  of  external  bodies  are  changed.  Hence,  all  pro¬ 
cesses  accompanied  by  friction  are  irreversible  ones. 

Example  2.  The  direct  process  of  heat  exchange  (9.2.4)  between 
two  bodies  having  different  temperatures  is  a  spontaneous  one. 
The  reverse  process,  heating  one  body  by  cooling  another  that 
initially  has  the  same  temperature  as  the  first  body,  cannot 
proceed  spontaneously.  To  accomplish  this  process,  a  refrigerat¬ 
ing  device  (11.1.11)  is  required.  The  process  of  heat  exchange 
at  a  finite  temperature  difference  is  an  irreversible  one. 

11.3  Second  Law  of  Thermodynamics 

11.3.1  The  reference  in  the  preceding  section  to  experiments 
that  prove  the  irreversibility  of  the  processes  of  heat  exchange 
and  motion  with  friction  (11.2.2)  was  no  chance  one.  The  first 
law  of  thermodynamics  (9.3.1)  cannot  describe  thermodynamic 
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processes  comprehensively.  An  essential  limitation  of  the  first 
law  is  that  it  cannot  be  used  to  predict  in  which  direction  a 
thermodynamic  process  will  proceed.  Any  process  that  does 
not  violate  the  law  of  conservation  of  energy  is  feasible  from  the 
point  of  view  of  the  first  law  of  thermodynamics.  Among  others, 
the  spontaneous  transfer  of  energy  in  the  form  of  heat  from  a 
cooler  body  to  a  hotter  body  is  also  possible.  Also  feasible  is  a 
process  whose  only  result  would  be  the  obtaining  of  a  certain 
quantity  of  heat  from  a  body  and  the  conversion  of  this  heat 
into  an  equivalent  amount  of  work.  A  periodic-action  device, 
based  on  the  first  law  of  thermodynamics,  which  performs  work 
by  cooling  a  single  source  of  heat  (for  instance,  the  internal 
energy  of  large  bodies  of  water),  is  called  a  perpetual  motion 
machine  of  the  second  kind. 

11.3.2  The  second  law  of  thermodynamics  is  the  experimentally 
established  statement  of  the  impossibility  of  building  a  perpe¬ 
tual  motion  machine  of  the  second  kind  (11.3.1).  The  two  most 
common  statements  of  the  second  law,  equivalent  to  each 
other,  are: 

(a)  no  process  is  possible  whose  sole  result  is  the  conversion  of 
all  the  heat  received  from  a  certain  body  into  an  equivalent 
amount  of  work; 

(b)  no  process  is  possible  whose  sole  result  is  the  transfer  of 
energy  in  the  form  of  heat  from  a  cold  body  to  a  hotter  one. 

11.3.3  It  follows  from  the  second  law  of  thermodynamics  that 
work  and  heat  are  not  equivalent  forms  of  transferring  energy. 
The  conversion  of  orderly  motion  of  a  body  as  a  whole  into  the 
random  motion  of  its  particles  is  an  irreversible  process,  pro¬ 
ceeding  without  compensating  processes  (11.2.2).  The  conver¬ 
sion  of  random  motion  of  the  particles  of  a  body  into  ordered 
motion  of  the  body  as  a  whole  requires  the  provision  of  some 
simultaneous  compensating  process*. 

Example  1.  In  the  isothermal  expansion  of  an  ideal  gas  work 
is  done  that  is  fully  equivalent  to  the  quantity  of  heat  delivered 
(o  the  gas  (9.5.9).  The  heat  received  by  the  gas  is  completely 
converted  into  an  equivalent  amount  of  work.  But  the  gas 
does  not  return  to  its  initial  state.  It  expands  and  its  specific 
volume  is  increased.  The  conversion  of  heat  into  work*  is  not 
l lie  sole  result  of  the  isothermal  expansion  of  an  ideal  gas. 


*  Such  a  process  is  sometimes,  not  quite  correctly,  called  the 
“conversion  of  heat  into  work”. 
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Example  2.  In  a  heat  engine  based  on  the  direct  Carnot  cycle 

(11.1.8) ,  work  is  done  owing  to  the  heat  delivered  from  the 
heater.  But  a  part  of  the  quantity  of  heat  obtained  is  given  up  to 
the  cooler  (11.1.7).  Hence,  the  work  done  in  a  cycle  is  not  equi¬ 
valent  to  the  whole  quantity  of  heat  delivered  to  the  working 
medium. 

Example  3.  In  refrigerating  devices  operating  on  the  reverse 
Carnot  cycle  (11.1.11),  a  certain  quantity  of  heat  is  transferred 
from  a  cold  body  to  a  hotter  one.  But  external  forces  do  work  to 
accomplish  this  and,  consequently,  a  compensating  process  is 
being  carried  out. 

11.3.4  The  Carnot  theorem  states  that  the  thermal  efficiency 

(11.1.9)  of  the  reversible  Carnot  cycle  does  not  depend  upon 
the  composition  of  the  working  medium  and  is  always  ex¬ 
pressed  by  the  formula  (11.1.10): 


Rc 


<?l+<?2  _  ^1  —  ^2 

Qx  Tt  • 


The  thermal  efficiency  r)c  ir  of  the  irreversible  Carnot  cycle  is 
always  less  than  the  thermal  efficiency  rq  rev  of  the  reversible 
Carnot  cycle,  carried  out  at  the  same  temperatures  Tx  and  T2: 
RC  lr  <  RC  rev- 

The  thermal  efficiency  of  any  reversible  cycle  cannot  be  higher 
than  that  r|r:  of  a  reversible  Carnot  cycle  being  accomplished 
by  means  of  a  heater  and  cooler  having  the  temperatures  T1 
and  T2: 


Rrev  ^  Re 


Tl-T2 

Tx 


11.3.5  The  Carnot  theorem  (11.3.4)  is  used  to  establish  the 
thermodynamic  temperature  scale.''  It  follows  from  the  first  equa¬ 
tion  in  the  preceding  subsection  that  Tj Tx—  — QJQy  or,  since 
Q 2 o, 

T 2  rIQ,l 

Tx  ~  Qx  ' 

In  order  to  compare  the  temperatures  T}  and  T2  of  two  bodies 
it  is  necessary  to  carry  out  a  reversible  Carnot  cycle  in  which 
these  bodies  are  the  heater  and  cooler.  Then,  according  to  the 
ratio  of  the  numerical  values  of  the  quantities  of  heat  delivered 
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to  and  received  from  the  bodies,  the  ratio  of  their  temperatures 
is  determined.  The  result  of  such  a  comparison  of  temperatures 
does  not  depend  upon  the  chemical  composition  of  the  working 
medium  (11.3.4).  Therefore,  the  thermodynamic  temperature 
scale  is  independent  of  the  properties  of  the  temperature-indi¬ 
cating  substance  (8.3.4)  and,  in  this  respect,  has  great  generali¬ 
ty.  However,  since  all  real  thermodynamic  processes  are  irre¬ 
versible,  the  comparison  of  temperatures  by  the  above-men¬ 
tioned  procedure  is  unfeasible;  it  has  only  a  theoretical  signi¬ 
ficance. 


11.4  Entropy  and  Free  Energy 


11.4.1  The  ratio  of  the  quantity  of  heat  Q,  received  by  a  body 
in  an  isothermal  process,  to  the  temperature  T  of  the  body 
giving  up  the  heat  is  called  the  reduced  quantity  of  heat  Q*,  i.e. 
Q*  =  Q/T.  In  heating  a  body  (Q  >  0),  Q*  is  positive;  in  cool¬ 
ing  ( Q  <  0),  Q*  is  negative. 

The  reduced  quantity  of  heat  delivered  to  a  body  in  an  infi¬ 
nitely  small  portion  of  an  arbitrary  process,  is  equal  to  8Q/T, 
where  T  is  the  temperature  of  the  corresponding  heater*.  The 
reduced  quantity  of  heat  <?jL2  for  the  arbitrary  portion  1  -*•  2  of 
the  process  CjCj  is 


11.4.2  The  reduced  quantity  of  heat  Q*ev  delivered  to  a  body 
in  any  reversible  cycle  equals  zero: 


rev 


Here  T  is  the  temperature  at  which  an  element  of  the  quantity 
of  heat  is  delivered  to  the  body.  It  follows  from  the  equation 
in  the  preceding  subsection  that  the  function  is  a  com- 


*  In  the  case  of  a  reversible  process,  T  coincides  with  the  tem¬ 
perature  of  the  body  that  is  carrying  out  the  process. 
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plete  differential  of  a  certain  function  S  [in  contrast  to  5 Q, 
which  is  not  a  complete  differential  (9.4.3)]: 


The  single-valued  state  function  (9.1.3)  S,  whose  complete  diffe¬ 
rential  is  determined  by  the  preceding  equation,  is  called  the 
entropy  of  a  body.  It  is  evident  from  this  equation  that  dS  and 
bQ  have  the  same  sign.  Consequently,  from  the  way  in  which 
the  entropy  varies  it  is  possible  to  determine  in  which  direction 
heat  exchange  (9.2.4)  takes  place.  In  heating  a  body  ( 8Q  >  0) 
its  entropy  increases  ( dS  >0).  If  a  body  is  cooled  (8Q  <  0), 
its  entropy  decreases  ( dS  <  0). 

Example.  The  complete  differential  of  the  entropy  of  an  ideal 
gas  is  expressed  by  the  equation 


dS  = 


=  **-CVll 
rev  (X  ^ 


dT 

T 


M  RdV 
V-  V 


where  M  is  the  mass  of  the  gas,  p  is  its  molar  mass  (8.4.3), 
CVfl  is  its  molar  heat  capacity  at  constant  volume  (9.5.4),  R 
is  the  universal  gas  constant  (8.4.4),  T  is  its  temperature, 
and  V  is  its  volume.  This  result  is  obtained  on  the  basis  of  the 
first  law  of  thermodynamics  (9.3.1)  for  6Q,  with  the  Mendeleev- 
Clapeyron  equation  (8.4.4)  taken  into  account. 

The  change  A S1_>.2  in  the  entropy  of  an  ideal  gas  going  over 
from  state  1  to  state  2  does  not  depend  upon  the  kind  of  pro¬ 
cess  1  ->  2  used  in  this  transition: 


A 5^2  =  5,-^  =  ^-  (c^ln  In  . 

11.4.3  Certain  of  the  most  important  properties  of  the  entropy 
of  closed  systems  (2.2.4)  are: 

(a)  The  entropy  of  a  closed  system  in  a  reversible  Carnot  cycle 
(11.1.6)  does  not  change: 

ASrev  —  0  and  S  =  const. 


(b)  The  entropy  of  a  closed  system  increases  in  an  irreversible 
Carnot  cycle: 

A Sir  >  0. 


11.4]  ENTROPY  AND  FREE  ENERGY 


187 


(c)  In  any  processes  that  occur  in  a  closed  system  its  entropy 
never  decreases: 

A S  >  0. 

In  an  element  of  change  in  the  state  of  a  closed  system,  the 
entropy  does  not  decrease:  dS  >  0. 

The  equality  sign  pertains  to  reversible  processes  and  the  greater- 
than  sign  to  irreversible  ones.  Item  (c)  is  one  of  the  statements 
of  the  second  law  of  thermodynamics. 

11.4.4  The  following  relation  is  valid  for  an  arbitrary  process 
occurring  in  a  thermodynamic  system  (8.3.1): 

6 <?  <  T  dS, 

where  T  is  the  temperature  of  the  body  that  transfers  the  energy 
6 Q  to  the  thermodynamic  system  in  the  process  of  an  infinitely 
small  change  in  the  state  of  the  system.  Making  use  of  an  equa¬ 
tion  of  the  first  law  of  thermodynamics  (9.3.1)  for  6 Q,  the 
preceding  inequality  can  be  rewritten  in  the  form 

T  dS  >  dU  +  6  IF, 

combining  the  first  and  second  laws. 

11.4.5  For  a  reversible  process 
8 IF  =  —{dU  —  T  dS) 

or 

8 IF  =  —d(U  —  TS)  —  S  dT  —  —dF  —  S  dT, 
where  F  =  U  —  TS 

is  called  the  free  energy.  The  free,  or  available,  energy  is  the 
difference  of  two  state  functions  (8.3.8)  and  is  therefore  also  a 
state  function  of  the  thermodynamic  system. 

If  the  system  carries  out  a  reversible  isothermal  process,  dT  — 
=  0  and  8lFiSot  =  —  dF.  When  the  system  passes  from  state  1 
to  state  2  in  a  reversible  isothermal  process, 

IFjsot  = 

The  decrease  in  free  energy  is  the  measure  of  the  work  done  by  a 
system  (or  body)  in  a  reversible  isothermal  process. 
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11.4.6  It  follows  from  the  equation  U  =  F  +  TS  that  the 
internal  energy  of  a  body  (or  system)  is  equal  to  the  sum  of  the 
free  energy  F  and  the  bound  energy  TS.  The  bound,  or  unavail¬ 
able,  energy  is  the  part  of  the  internal  energy  of  a  body  (or 
system)  that  cannot  be  transferred  in  the  form  of  work  in  an 
isothermal  process.  In  this  sense,  this  part  of  the  internal  energy 
is  “depreciated”.  The  more  the  entropy  of  the  body  (or  system), 
the  more  the  bound  energy.  Therefore,  the  entropy  of  a  body 
(or  system)  serves  as  a  measure  of  the  “depreciation”  of  its 
energy. 


11.5  Statistical  Interpretation  of  the  Second  Law 
of  Thermodynamics 

11.5.1  The  statement  of  the  second  law  of  thermodynamics  on 
the  impossibility  of  a  decrease  in  the  entropy  in  an  isolated 
system  (11.4.3)  can  be  interpreted  statistically,  on  the  basis 
of  the  molecular-kinetic  theory  of  the  structure  of  matter,  by 
means  of  Boltzmann’s  relation: 

S  —  k  In  P  -f-  const, 

where  S  is  the  entropy  of  the  system,  k  is  Boltzmann’s  constant 
(8.4.5),  and  P  is  the  thermodynamic  probability  of  the  state. 

11.5.2  The  thermodynamic  probability  P  of  a  state  of  a  body  (or 
system)  is  equal  to  the  number  of  all  possible  kinds  of  coordi¬ 
nate  and  velocity  distributions  of  its  particles  corresponding  to 
the  given  thermodynamic  state  (8.3.3).  By  definition,  P  is  a 
whole  number  not  less  than  unity  (P  1).  The  Boltzmann  rela¬ 
tion  (11.5.1)  leads  to  the  following  statistical  interpretation  of 
the  second  law  of  thermodynamics:  the  thermodynamic  prob¬ 
ability  of  a  slate  of  a  closed  system  cannot  decrease,  regardless 
of  the  processes  that  occur  in  it.  In  any  process  that  proceeds 
in  .  a  closed  system  and  brings  it  from  state  1  to  state  2,  the 
change  A P  in  the  thermodynamic  probability  is  either  posi¬ 
tive  or  zero:  A P  —  P2  —  Pt  >  0. 

In  a  reversible  process  A P  —  0,  i.e.  the  thermodynamic  prob¬ 
ability  P  is  constant.  If  the  process  is  irreversible,  AP  >  0 
and  P  increases.  This  means  that  an  irreversible  process  trans¬ 
fers  the  system  from  a  less  probable  state  to  a  more  probable  one 
and,  in  the  limit,  to  an  equilibrium  state  (8.3.3). 
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11.5.3  Being  a  statistical  law,  the  second  law  of  thermodyna¬ 
mics  describes  the  laws  of  random  motion  of  a  great  number  of 
particles  making  up  a  closed  system.  In  systems  that  consist  of 
a  relatively  small  number  of  particles,  fluctuations  (11.6.1)  are 
observed.  They  are  deviations  from  the  second  law  of  thermo¬ 
dynamics. 

11.5.4  The  second  law  of  thermodynamics,  established  for 
closed  systems  on  the  earth,  cannot  be  applied  to  the  whole  infi¬ 
nite  universe.  Such  an  application  leads  to  the  conclusion  that 
the  temperatures  of  all  bodies  in  the  universe  equalize.  This  is 
incorrect  from  both  the  philosophical  and  physical  points  of 
view.  According  to  this  erroneous  conclusion,  all  forms  of 
motion,  except  random  thermal  motion,  will  cease  and  the 
universe  will  then  reach  a  state  of  so-called  “heat  death”.  Actu¬ 
ally,  owing  to  the  infinity  of  the  universe,  fluctuations  (11.6.1) 
are  inevitable  in  certain  of  its  parts.  These  violate  thermal 
equilibrium.  The  duration  and  magnitude  of  the  fluctuations 
can  be  extremely  great.  It  has  been  proven  that  there  can  be 
no  equilibrium  state,  corresponding  to  “heat  death”,  for  the 
infinite  universe. 

11.6  Fluctuations 

11.6.1  Substantial  deviations  of  certain  physical  quantities, 
characterizing  a  system,  from  their  average  values  may  occur 
in  systems  consisting  of  a  comparatively  small  number  of  par¬ 
ticles.  Such  deviations  are  called  fluctuations  of  physical  quan¬ 
tities.  For  instance,  in  highly  rarified  gases  the  density  at  var¬ 
ious  places  in  the  volume  of  the  gas  may  differ  from  the  aver¬ 
age  density  corresponding  to  the  equilibrium  state  for  definite 
p  and  T  values.  Chance  deviations  of  the  temperature  T,  pres¬ 
sure  p  and  other  physical  quantities  may  occur  in  exactly  the 
same  way. 

11.6.2  If  M  is  the  true  value  of  a  physical  quantity,  and  (M) 
is  its  average  value,  then  the  quantity  AM,  where  AM  = 
=  M  —  (M),  and  its  average  value  (AM)  =  (M  —  (M)) 
cannot  be  measures  of  the  fluctuation  of  the  quantity  M.  The 
quantity  AM  is  not  constant  with  time,  and  the  quantity 

(AM)  =  (M)  —  (M)  =  0*. 

*  Used  here  is  the  statement  that  the  mean  value  (M)  of  a  con¬ 
stant  quantity  coincides  with  the  quantity. 


190 


SECOND  LAW  OP  THERMODYNAMICS  [11 


This  last  equation  follows  from  the  fact  that  the  deviation  of 
the  quantity  M  from  (M)  occurs  in  both  directions,  towards 
values  larger  than  the  average  and  towards  values  smaller  than 
the  average,  with  equal  frequency. 

11.6.3  A  measure  of  the  fluctuations  of  the  physical  quantity 
M  is  the  mean  square  of  the  difference  AM.  It  is  called  the 
mean  square  fluctuation'. 

((A  Mf)  =  {(M  —  AM)2)  =  (M2)  —  ((M))2.* 


The  mean  square  fluctuation  is  essentially  positive  or  zero: 

((AM)2)  >  0.  _ 

The  absolute  fluctuation  is  the  quantity  Y  {(AM)2),  which  also 
characterizes  the  deviation  of  M  from  (M).  A  small  absolute 
fluctuation  signifies  that  large  deviations  of  M  from  (M)  occur 
very  rarely. 

The  relative  fluctuation  bM  is  the  ratio  of  the  absolute  fluctuation 
to  the  average  value  (M)  of  a  physical  quantity: 


8m  — 


Y  {(AMY) 
(M) 


The  relative  fluctuations  in  the  concentration  of  particles  (or 
density)  of  a  gas,  and  in  its  pressure  and  temperature,  decrease 
with  an  increase  in  the  number  N  of  gas  molecules  contained  in 
the  vessel: 


«e 


V  ((Ap)2)  1 

<p>  a  Yn  ’ 

/Pi5)  r,  1 

(T)  Yn  • 


,  Y ((Ap)2)  _  1 

8p~"  (p)  a  ~YW 


When  N  —  Nx  (Avogadro’s  number,  see  Appendix  II),  8P,  6p 
and  8r  have  values  of  the  order  of  10-14. 


*  This  last  equation,  following  from  the  rules  of  algebraic  oper¬ 
ations  with  average  values,  underlines  the  fact  that  the  average 
value  (M2)  of  the  square  of  a  quantity  should  not  be  confused 
with  the  square  of  the  average  value  ((M))3. 
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If  a  system  consists  of  N  independent  particles,  the  relative 
fluctuation  of  an  additive  state  function  (9.1.3)  of  the  system 
is  inversely  proportional  to  the  square  root  of  i V: 


8m  OC 


_i _ 

Yn  ' 


11.6.4  The  following  are  examples  of  fluctuations  of  physical 
quantities. 

Example  1.  In  measuring  the  temperature  by  means  of  a  gas 
thermometer  fdled  with  an  ideal  gas  (8.4.1),  the  thermometer 
readings  are  not  constant  owing  to  fluctuations  of  temperature. 
The  variations  in  temperature  measured  by  the  thermometer 
cannot  be  less  than  the  absolute  fluctuation  of  the  thermometer 
indication,  equal  to  Y  ((AT)2),  i.e.  At  ^  Y  ((AT)2). 

The  absolute  fluctuation  can  be  determined  by  the  formula  in 
11.6.3: 

Thus 

Ao/PFIa-^-. 

If  the  gas  thermometer  contains  10~8  mole  of  gas,  i.e.  N  = 
=  6.02  x  1016,  then  the  minimum  changes  in  temperature  At 
that  can  be  detected  by  the  instrument,  are  of  the  order  of  mag¬ 
nitude  At  x  10 ~10  { T ).  This  value  indicates  the  limit  of  sen¬ 
sitivity  of  the  gas  thermometer. 

Heal  variations  in  temperature,  commonly  encountered  in 
experiments,  are  incommensurably  larger  than  At. 

Example  2.  Electrical  fluctuations  in  circuits  restrict  the  lim¬ 
its  of  sensitivity  of  radio  receiving  equipment.  Among  others, 
fluctuations  in  the  number  of  electrons  emitted  by  the  incandes¬ 
cent  cathode  lead  to  fluctuations  in  the  current  supplied  to  the 
vacuum  tube.  This  is  called  shot,  or  Schottky,  noise.  A  measure 
of  the  shot  noise  is  the  mean  square  fluctuation  of  the  current: 


((A/)2) 
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where  e  is  the  charge  of  the  electron,  and  /„  is  the  average  cur¬ 
rent  over  the  time  t  during  which  it  is  measured.  Here  t  3>  t, 
where  t  is  the  time  of  flight  of  the  electron  in  the  tube. 


11.7  Brownian  Movement 


11.7.1  The  Brownian  movement  is  the  continuous  haphazard  zig¬ 
zag  motion  of  small  particles  suspended  in  a  gas  or  liquid  and 
can  be  observed  in  a  microscope.  The  Brownian  movement  is 
caused  by  fluctuations  (11.6.1)  in  the  pressure  exerted  by  the 
molecules  of  gas  or  liquid  on  the  surfaces  of  the  minute  suspend¬ 
ed  particles.  As  a  result  of  pressure  fluctuations  the  Brownian 
particles  are  subject  on  all  sides  to  the  action  of  unbalanced 
forces  which  cause  their  complex  motions  that  are  observed. 

11.7.2  Under  constant  external  conditions,  no  changes  are 
observed  in  the  motions  of  the  Brownian  particles  and  they 
continue  indefinitely.  This  is  testimony  to  the  continuous  na¬ 
ture  of  random  thermal  motion  of  the  molecules  to  which  the 
displacements  of  the  Brownian  particles  are  due.  The  velocity 
v  and  energy  E  of  motion  of  these  particles  depend  upon  their 
size,  but  not  on  their  chemical  composition.  The  values  of  v 
and  E  increase  with  the  temperature  and  with  a  reduction  in 
the  viscosity  of  the  gas  or  liquid. 

11.7.3  It  has  been  experimentally  established  that  Brownian 
particles  can  move  upwards,  rising  in  the  fluid.  This  occurs 
when  the  molecules  of  the  gas  (or  liquid)  transmit  an  uncom¬ 
pensated  momentum  directed  upwards  to  the  particle.  This 
increases  the  potential  energy  of  the  particle  at  the  expense  of 
the  kinetic  energy  of  the  molecules  surrounding  it,  thereby 
leading  to  localized  cooling  of  the  gas  or  liquid.  This  means  that 
the  mechanical  energy  of  the  Brownian  particle  has  increased 
by  cooling  a  single  source  of  heat,  the  liquid  or  gas,  thereby 
contradicting  the  second  law  of  thermodynamics  (11.3.2).  In 
this  manner,  Brownian  movement  is  proof  of  the  limited  appli¬ 
cation  of  the  second  law  of  thermodynamics  and  of  its  statisti¬ 
cal  nature  (11.5.1). 

11.7.4  The  motions  of  Brownian  particles  are  completely  chao¬ 
tic.  Therefore,  the  average  displacement  ( x )  of  a  particle  along 
an  arbitrary  direction  equals  zero.  Then  mean  square  displa- 
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cement  { x 2)  is  proportional  to  the  time  l  of  observation  of  the 
particle  and  is  expressed  by  Einstein’s  equation: 

<x2>  =  2  Dt, 

where  D  is  the  diffusion  coefficient  of  the  Brownian  particles. 
For  spherical  particles  of  radius  r 


Here  T  is  the  absolute  temperature,  R  is  the  universal  gas  con¬ 
stant  (8.4.4),  rj  is  the  coefficient  of  viscosity  of  the  liquid  or  gas 
(10.8.4),  and  N a  is  Avogadro’s  number  (Appendix  II). 

11.8  Third  Law  of  Thermodynamics 

11.8.1  As  a  single-valued  state  function  of  a  system  entropy  is 
introduced  by  means  of  a  differential  relation  (11.4.2).  Hence, 
the  entropy  can  be  determined  up  to  an  arbitrary  constant, 
which  cannot  be  found  from  the  first  and  second  laws  of  thermo¬ 
dynamics.  It  is  consequently  impossible  to  determine  the 
absolute  value  of  the  entropy. 

11.8.2  An  experimental  investigation  of  the  properties  of 
substances  at  ultralow  temperatures  led  to  the  establishment  of 
the  third  law  of  thermo  dynamics.  This  law,  also  called  the  Nernst 
heal  theorem ,  states  that  the  change  in  entropy  equals  zero  in 
any  isothermal  process  conducted  at  absolute  zero: 

A.S’y-^o  —  0  and  S  =  ,S'0  =  const 

regardless  of  the  changes  in  any  other  state  variables  (for  example, 
volume,  pressure,  and  intensity  of  the  external  force  field). 
The  third  law  does  not  enable  the  absolute  value  of  the  entropy 
to  be  determined.  But  the  fact  that  the  entropy  is  constant  as 
T  -*■  0  permits  this  constant  value  to  bo  employed  as  the  ref¬ 
erence  point  for  entropy  values.  This  enables  the  change  in  en- 
Iropy  to  be  determined  in  processes  being  investigated. 

11.8.3  Planck  extended  Nernst’s  heat  theorem  by  stating  that 
the  entropy  of  a  system  equals  zero  at  absolute  zero  tempera¬ 
ture.  If  E0,  E j,  .  .  .,  En  is  a  sequence  of  energy  levels  (38.2.5) 
of  a  system,  then  at  absolute  zero  temperature  the  system  in 
equilibrium  is  in  a  quite  definite  lowest  state  with  energy  exact¬ 
ly  equal  to  E0.  Hero  the  thermodynamic  probability  P  of  the 
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state  (11.5.2)  is  equal  to  unity  (P  =  1)  and,  according  to  Boltz¬ 
mann’s  relation  (11.5.1), 


S0  =  k  In  P  =  0. 

11.8.4  The  heat  capacity  CVU  at  constant  volume  (9.5.4)  for 
all  bodies  vanishes  at  T  =  0  K.  As  a  matter  of  fact,  if  the 
temperature  of  a  system  is  sufficiently  low,  so  that  the  average 
kinetic  energy  kT  of  a  particle  (10.0.4)  is  substantially  less 
than  the  difference  AE  between  the  lowest  and  first  energy  lev¬ 
els  (A E  >  kT),  then  the  thermal  excitation  of  the  system  is 
insufficient  to  transfer  it  from  the  state  with  the  energy  E0  to 
one  with  the  energy  Ev  Therefore,  at  ultralow  temperatures  the 
system  should  be  in  a  state  having  the  least  energy  E0.  The 
internal  energy  Ug  of  the  system  (9.1.2)  is  equal  to  Ee,  i.e. 
U0  =  E0.  Hence  the  heat  capacity  of  the  system  at  constant 
volume 


(  dU  \  _/ 

dE  o  \ 

[  dT  )v  \ 

,  dT  ) 

^0  as  T  -*  0. 


The  coefficient  of  cubic  expansion  (40.2.3)  also  vanishes  at 
absolute  zero  temperature. 

It  follows  from  the  third  law  that  a  process  in  which  a  body 
could  be  cooled  to  absolute  zero  temperature  is  impossible 
( principle  of  the  unallainability  of  absolute  zero  temperature, 
sometimes  called  the  Nernst  principle). 


CHAPTER  12  REAL  GASES  AND  VAPOURS 

12.1  Forces  of  Intermolecular  Interaction 

12.1.1  The  properties  of  gases  that  are  not  highly  rarified  differ 
from  those  of  ideal  gases,  which  comply  with  the  Mendeleev- 
Clapeyron  equation  (8.4.4).  Experiments  indicate  that  the 
specific  heat  capacities  (9.5.2),  coefficients  of  viscosity  (10.8.4) 
and  other  quantities  have  different  values  for  real  gases  than 
for  ideal  gases. 

12.1.2  A  real  gas  is  one  in  which  forces  of  molecular  interaction 
exist  between  the  molecules. 

A  vapour  is  a  real  gas  in  states  close  to  that  in  which  it  con¬ 
denses. 
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12.1.3  The  forces  of  intermolecular  interaction  decrease  rapidly 
with  an  increase  in  the  distances  between  the  molecules  ( short- 
range  forces).  At  intermolecu¬ 
lar  distances  exceeding  1()~9  m, 
these  forces  can  be  neglected. 

Forces  of  interaction  between 
molecules  can  be  subdivided 
into  forces  of  attraction  and 
forces  of  repulsion.  Both  types  act 
simultaneously.  Otherwise  it 
would  be  impossible  to  deter¬ 
mine  the  volume  of  liquids  or 
solids:  their  constituent  par¬ 
ticles  would  either  fly  apart  in 
all  directions  or  would  stick 
together,  forming  the  minimum 
possible  volume. 

12.1.4  The  forces  of  mutual 
attraction  and  repulsion  depend 
differently  on  the  distance  r 
between  the  molecules.  At  dis¬ 
tances  commensurable  with 
the  linear  dimensions  of  atoms 
and  small  inorganic  molecules 
(10~10  m),  repulsive  forces  F,  pre¬ 
dominate;  at  distances  r  of  the  order  of  10 -9  m  forces  of  mutual 
attraction  F2  prevail.  If  r  is  the  radius  vector  to  the  point 
where  molecule  A  is  located  from  a  point  where  molecule  B  is 
located  and  acts  on  molecule  A  with  the  forces  Fi  and  F2,  then 

Fi  =  Fi r  —  and  F2=F2T—p, 

Projections  Flr  and  F  2T  of  forces  Ft  and  F2  on  the  direction  of  r 
depend  upon  r  in  the  following  way 

Flr  =  6/r13  and  F2T  =  —air1, 

where  a  and  b  are  factors  that  depend  on  the  structure  of  the 
molecules  and  the  type  of  intermolecular  interaction.  The  curves 
in  Fig.  12.1  show  the  dependence  of  Flr  and  F2T  on  r.  Forces 
of  repulsion  are  conventionally  accepted  as  positive  and  at¬ 
tractive  forces  as  negative  (Fig.  12.1). 
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The  resultant  force  is 
F  =  F1-|-F2=Fr-L  , 

and  Fr  =  Flr  +  F,lr.  The  dependence  of  Fr  on  r  is  also  shown 
in  Fig.  12.1. 

12.1.5  At  r  =  r0,  forces  F,  and  F2  counterbalance  each  other 
and  F  =  0.  At  r  >  r0  force  F2  >  Fx;  at  r  <  r0  force  F2  <  Fx. 
Thus  r0  is  the  equilibrium  distance  between  the  molecules  and 
where  they  would  be  with  respect  to  each  other  if  there  were 
no  thermal  motion. 

12.1.6  The  element  of  work  bW  done  by  the  resultant  force  F, 
when  the  distance  between  the  molecules  is  increased  by  the 
amount  dr,  is  equal  to  the  decrease  in  the  mutual  potential 
energy  E p  of  the  two  molecules  (3.3.1): 


6  W  =  (F,  dr)=  Fr  dr  =  —  dEp. 

Integrating  with  respect  to  r  from  r  to  oo: 

^P°°  oo  oo 

ij  Frdr  and  Ep — £po o—  j  Frdr. 

Ep'  r  r 

At  r  =  oo  the  molecules  do  not  interact  and  Epoo  =  0.  Hence, 

OO 

Ep  =  j  Fr  dr. 

r 


This  integral  can  be  calculated  graphically  from  the  known  de¬ 
pendence  of  Ft  on  r  (Fig.  12.1).  It  is  proportional  to  the  area 
bounded  by  the  curve  Fr  =  Fr(r),  the  r  axis  and  the  value  of  r 
(where  r  =  const)  for  which  Ep  is  being'  calculated.  At  r  >  r0, 
the  energy  E p  <  0  because  Fr  <  0;  at  r  =  r0,  the  energy  Ep 
reaches  its  minimum  value:  Ep  =  E min.  This  follows  from 
the  equation 


/  dEp  \ 
l  dr  ), 


~ro 


-—Fr  (r o)  =  0- 


12.1]  INTERMOLECULAR  INTERACTION 


197 


A  system  consisting  of  the  interacting  molecules,  in  a  state  of 
stable  equilibrium  (r=  r0),  possesses  minimum  potential  energy. 
At  r<r0,  the  energy  E-p  begins  to  increase,  becomes  positive 
and  then  rises  drastically  due  to  the  rapid  increase  in  the 
repulsive  forces  as  r  drops  (Fig.  12.2). 

12.1.7  The  value  E  =  Ev  mln,  the  minimum  potential  energy 
of  interaction  of  molecules,  is  the  criterion  for  various  states 
of  aggregation  of  matter. 

If  I  E  |p  min  <c  kT,  the  substance  is  in  a  gaseous  state.  When 
I  E  |p  min  >  kT,  it  is  in  the  solid  state.  The  condition 


I  E  Ip  mln  ~  kT  corresponds  to  the  liquid  slate.  Here  kT  is 
twice  the  average  energy  per  degree  of  freedom  of  thermal  mo¬ 
tion  of  the  molecules  (10.6.4). 

12.1.8  Van  der  Waals  forces  are  weak*  forces  of  attraction  that 
act  between  molecules  at  distances  of  the  order  of  10-9  m  (12.1.4). 
They  are  the  reason  for  the  correction  of  the  internal  pressure 
in  the  Van  der  Waals  equation  of  state  for  a  real  gas  (12.2.4). 
Distinction  is  made  between  three  typos  of  Van  der  Waals  forces 
of  attraction. 

(a)  Orientational  forces  of  attraction  For  are  due  to  the  fact 
that  asymmetric  polar  molecules  (18.1.4)  have  constant  dipole 


*  Weak  in  the  sense  of  their  small  magnitude  compared  to  the 
forces  of  attraction  that  provide  for  the  formation  of  stable 
molecules  (12.1.9), 
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moments  pe  (18.1.4).  The  result  of  interaction  of  these  dipole 
moments  is  the  attraction  force  For.  This  force  is  opposed  by 
the  thermal  motion  of  the  molecules.  Calculations  indicate 
that 


F  or  OC 


kT  r7 


where  r  is  the  distance  between  the  molecules,  k  is  Boltzmann’s 
constant  (8.4.5),  and  T  is  the  absolute  temperature. 

(b)  Induction  forces  of  attraction  Fln  are  exerted  when  a  neutral 
molecule  of  a  real  gas  is  in  an  electric  field  set  up  by  another 
molecule,  both  molecules  having  high  polarizability  (18.1.3). 
If  the  molecules  are  sufficiently  close  together,  the  action  of  the 
electric  field  of  the  second  molecule  develops  an  induced  dipole 
moment  pe  =  eopE  (18.1.4)  in  the  first  molecule,  where  (3  is 
the  polarizability  of  the  molecule  (18.1.3),  and  E  is  the  strength 
of  the  electric  field  set  up  by  the  second  molecule.  To  constant 
factors 


Fin  OC  -PPe  7T- 


(c)  The  vibration  of  electrons  in  one  molecule  (or  atom)  can 
excite  vibration  in  another  molecule  (or  atom).  The  vibrations 
of  the  electrons  in  the  two  molecules  are  in  phase  with  each 
other,  leading  to  attraction  between  the  molecules  (or  atoms). 
The  resulting  resonance  forces  of  attraction  are  called  disper¬ 
sion  forces.  They  play  the  main  role  in  the  interaction  of  non¬ 
polar  molecules  (18.1.3).  The  force  of  dispersion  attraction  is 


/’d  OC 


e4ftv0  1 
~k*~  r7  ’ 


where  e  is  the  charge  of  the  electron,  h  is  Planck’s  constant, 
v0  =  (1/2jt)  f/r ktm  is  the  frequency  of  vibration  of  the  atoms 
(28.1.2),  k  is  the  coefficient  of  the  quasi-elastic  force  (40.3.7), 
and  m  is  the  mass  of  the  atom. 

Dispersion  forces  owe  their  name  to  the  analogy  between  the 
origin  of  these  forces  and  dispersion  phenomena  occurring 
ychen  electromagnetic  waves  propagate  through  matter  (34.4.1). 
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12.1.9  The  attraction  force  Fr  between  two  molecules,  shown 
in  Fig.  12.1,  is  the  resultant  of  all  the  kinds  of  attraction  enu¬ 
merated  in  Sect.  12.1.8.  The  potential  energy  of  Van  der  Waals 
attraction  amounts  to  (0.4  to  4)  X  103  J/mol. 

in  addition  to  the  electromagnetic  interaction,  a  special  kind 
of  quantum  interaction  exists  between  molecules  at  distances 
r  <  10~10  m.  This  interaction  leads  either  to  the  development 
of  repulsive  forces  between  the  molecules  (12.1.4),  or  to  strong 
attraction  of  adjacent  atoms  (or  groups  of  atoms)  and  the  for¬ 
mation  between  them  of  ionic  and  covalent  chemical  bonds 
(39.4.4  and  39.4.5).  The  result  of  such  bonds  is  the  formation 
of  stable  molecules.  The  potential  energy  of  chemical  bonds 
exceeds  the  energy  of.  Van  der  Waals  attraction  and  ranges  from 
(0.4  to  4)  X  104  J/mol. 

12.1.10  Being  internal  forces  (2.2.4),  the  forces  of  interaction 
in  a  system  of  two  molecules  cannot  change  the  total  energy  E 
of  the  system,  which  is  the  sum  of  the  kinetic  energy  E ^  of  the 
molecules  and  their  mutual  potential  energy  Ev.  Hence 

(IE  —  dEk  -}-  dEp  —  0  or  dE ^  =  — dEp  —  Fr  dr. 

Use  is  made  here  of  the  equation  in  Sect.  12.1.6. 

When  the  molecules  are  brought  closer  together  (dr  <  0),  to  the 
distance  r0  (Fig.  12.1),  the  potential  energy  Ev  decreases  and 
the  kinetic  energy  E k  accordingly  increases.  This  occurs  at  the 
expense  of  the  positive  work  done  by  the  resultant  force  of  mu¬ 
tual  attraction  between  the  molecules  (12.1.4)  (Fr  <  0  when 
r  >  r0).  As  the  molecules  are  brought  still  closer,  they  do  work 
in  overcoming  the  resultant  force  of  mutual  repulsion  ( Fr  >  0 
when  r  <  r0).  This  reduces  the  kinetic  energy  of  the  molecules. 
At  the  instant  they  are  closest  together  (r  =  rl  in  Fig.  12.2) 
all  of  the  kinetic  energy  of  the  molecules  has  been  completely 
expended  in  doing  work  to  overcome  the  repulsive  forces: 
/'.'it  =  0  and  the  total  energy  E  is  equal  to  the  potential  energy 
E p,  i.e.  E  =  Ep  (Fig.  12.2). 

if  all  the  state  variables  of  real  gases  are  maintained  constant 
with  the  exception  of  the  temperature,  the  distance  i\  decreases 
when  the  gas  is  heated.  This  decrease  is  very  small,  however, 
even  at  high  temperatures.  This  is  due  to  the  extremely  steep 
increase  in  the  repulsive  forces  Fx  as  r  is  decreased  (Fig.  12.1). 
Thus  the  distance  rx  is  the  effective  diameter  d  of  the  molecule 
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(8.4.1).  The  finite  dimensions  of  molecules  of  real  gases  are 
explained  by  the  action  of  repulsive  forces  between  the  mol¬ 
ecules. 

12.2  Van  der  Waals  Equation  of  State 

12.2.1  A  Van  der  Waals  gas  is  a  model  of  a  real  gas  in  which 
the  molecules  are  dealt  with  as  perfectly  rigid  spheres  of  diam¬ 
eter  d  (12.1.10),  with  forces  of  mutual  attraction  acting  between 
them.  The  finite  size  of  the  spheres  signifies  that  the  repulsive 
forces  between  the  molecules  of  a  real  gas  are  also  taken  into 
account. 

12.2.2  The  molecules  of  a  real  gas,  each  having  a  volume  of 

v  —  nd3/6,  do  not  move  around  as  freely  in  a  vessel  as  the 
“point”  molecules  of  an  ideal  gas.  Therefore,  in  the  Mendeleev- 
Clapeyron  equation  (8.4.4)  pV^  =  RT,  the  total  volume  V u  of  a 
vessel  holding  one  mole  of  the  gas  should  be  replaced  by  the 
“free”  volume 

K  =  -  b, 

where  6  is  Van  der  Waals’s  correction  for  the  intrinsic  volume 
of  the  molecules.  Correction  b  is  equal  to  four  times  the  volume 
of  all  the  molecules  contained  in  one  mole  of  the  gas: 

b  =  4NAv, 

where  NA  is  Avogadro’s  number  (Appendix  II),  and  v  is  the 
volume  of  a  single  molecule. 

12.2.3  The  forces  of  mutual  attraction  between  the  molecules 
are  taken  into  account  for  a  Van  der  Waals  gas  by  introducing 
a  correction  for  the  pressure  in  the  Mendeleev-Clapeyron  equa¬ 
tion  (8.4.4).  Owing  to  the  short-range  nature  of  the  attraction 
forces  (12.1.3),  each  molecule  interacts  only  with  particles  that 
are  located  at  distances  7?m,  where  Rm  is  the  range  oj  mol¬ 
ecular  aclion,  with  a  magnitude  of  the  order  of  lO”9  m.  A  sphere 
of  radius  Rm  is  called  the  sphere  of  molecular  action. 

The  forces  of  attraction  of  a  molecule  within  the  bulk  of  the 
gas  to  other  molecules  are  mutually  counterbalanced  and  have 
no  effect  on  the  motion  of  the  given  molecule.  If  the  molecule 
is  within  a  layer  of  gas  that  borders  on  a  wall  of  the  vessel,  it  is 
subject  to  an  uncompensated  force  of  attraction  directed  in- 
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ward,  into  the  bulk  of  the  gas.  Consequently,  in  its  collision 
with  a  wall,  such  a  molecule  imparts  less  momentum  to  the 
wall  (10.2.1),  and  the  pressure  p  exerted  by  a  real  gas  on  the 
walls  is  reduced  in  comparison  to  the  pressure  pm  of  an  ideal 
gas  having  the  same  density  at  the  same  temperature, 

V  =  P  id  —  P*. 
or 

/'id  =  P  +  P*, 

where  p*  is  the  Van  der  Waals  correction  due  to  the  forces  of 
mutual  attraction  and  is  called  the  internal  pressure.  The  inter¬ 
nal  pressure  p*  is  inversely  proportional  to  the  square  of  the 
volume  V ^  of  the  vessel  occupied  by  one  mole  of  the  gas: 

a 

P*  --~y2~  , 

P 


where  a  is  the  Van  der  Waals  coefficient  and  depends  upon  the 
chemical  nature  of  the  gas. 

12.2.4  The  Van  der  Waals  equation  describes  the  state  of  a  real 
gas  and  differs  from  the  Mendeleev-Clapeyron  equation  (8.4.4) 
in  that  the  corrections  b  and  p*  have  been  introduced  (12.2.2 
and  12.2.3).  For  a  single  mole  of  gas  it  is  of  the  form 

(p  +  ^)(>W')=tfT. 


12.2.5  The  Van  der  Waals  equation  fqr  an  arbitrary  mass  of 
real  gas,  having  the  molar  mass  p  (8.4.3),  is 


This  equation  is  valid  for  gases  that  are  not  too  highly  com¬ 
pressed.  In  highly  rarefied  gases  FM>  b,  p*  p  and  the  Van  der 
Waals  equation  does  not  differ  from  the  Mendeleev-Clapeyron 
equation, 
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$2.3  Isothermals  of  Real  Gases.  Phase  Transitions 

12.3.1  The  curve  representing  the  dependence  of  the  molar 
volume  of  a  gas  on  the  pressure  at  constant  temperature  is  called 
the  isothermal,  or  isotherm,  of  a  real  gas.  Isothermals  of  carbon 
dioxide  gas  are  shown  in  Fig.  12.3.  At  temperatures  T  below 
Tc  =  340  K,  all  the  isothermals  have  horizontal  steps  along 
which,  in  addition  to  the  constant  temperature,  the  pressure  is 
also  constant  while  the  molar  volume  varies.  The  difference 
Vjj  —  V  B  of  the  molar  volumes  over  the  horizontal  steps  de¬ 
creases  as  the  temperature  is  raised  (Fig.  12.3).  At  T  =  Tc  this 
difference  vanishes.  The  temperature  T  =  Tc,  corresponding  to 
the  condition  Vd  —  VB  =  0,  is  called  the  critical  temperature. 
The  isothermal  of  a  real  gas  at  T  =  Tc  is  called  the  critical 
isothermal.  On  this  isothermal  points  D  and  B  merge  into 
point  C,  which  is  called  the  critical  point.  The  state  variables 
(8.3.2)  of  the  gas  at  the  critical  point  are  the  critical  variables 
Pnc>  P c  an^  TV 

Critical  point  C  is  an  inflection  point  on  the  critical  isothermal. 
Consequently,  the  tangent  to  the  isothermal  at  this  point  is 
parallel  to  the  OVu  axis. 

12.3.2  Any  subcritical  isothermal  ( T  <  Tc)  is  a  curve  repre¬ 
senting  the  continuous  transition  of  the  substance  from  the 
gaseous  to  the  liquid  state.  It  consists  of  three  portions:  TD, 
DB  and  BA,  each  of  which  represents  different  states  of  the 
substance.  Over  the  portion  TD  the  substance  is  in  the  gaseous 
state,  portion  DB  corresponds  to  the  transition  of  the  substance 
from  the  gaseous  to  the  liquid  state.  Over  the  portion  BA  of 
the  isothermal,  the  substance  is  a  liquid.  This  portion  is  almost 
vertical  owing  to  the  low  compressibility  of  liquids.  Points  D 
and  B  of  the  horizontal  portion  of  the  isothermal  correspond 
to  the  beginning  and  end  of  condensation  in  isothermal  compres¬ 
sion  of  a  real  gas.  On  the  contrary,  in  isothermal  expansion  of 
the  liquid,  points  B  and  D  are  the  beginning  and  end  of  the 
boiling  process.  Point  B  corresponds  to"the  state  of  a  boiling  liq¬ 
uid',  point  D  to  dry  saturated  vapour.  The  mixture  of  the  boiling 
liquid  and  dry  saturated  vapour  that  exists  at  any  point  M  of 
portion  BD  is  called  wet  vapour  (Fig.  12.3). 

12.3.3  In  thermodynamics  a  phase  is  a  totality  of  all  the  parts 
of  a  system,  having  the  same  chemical  composition  and  being 
in  the  same  state.  Wet  vapour,  for  instance,  is  a  two-phase  sys¬ 
tem  consisting  of  boiling  liquid  and  dry  saturated  vapour. 
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Shown  in  Fig.  12.4  are  two  boundary  curves  BC  and  DC  which 
connect  points  B  and  D  (Fig.  12.3)  at  various  temperatures. 
Curves  BC  and  DC  converge  at  critical  point  C.  Boiling  curve 
BC  separates  the  single-phase  liquid  region  /  from  the  two- 
phase  wet  vapour  region  II.  The  curve  BC  is  one  representing 
I  he  beginning  of  the  first-kind  phase  transition  of  the  substance 
from  the  liquid  to  the  gaseous  state.  The  boundary  curve  DC 


separates  the  two-phase  region  II  from  the  single-phase  region 
HI  in  which  the  substance  is  in  the  gaseous  state. 

12.3.4  The  two-phase  region  II  cannot  exist  at  pressures  above 
the  critical  one  pc  (12.3.1)  at  which  the  substance  can  be  in 
either  of  two  states,  gaseous  or  liquid.  No  pressure  can  convert 
gas  at  a  temperature  above  the  critical  Tc  into  the  liquid  state 
by  isothermal  compression.  The  critical  temperatures  of  many 
gases  are  very  low:  T0  x  5  K  for  helium  and  33  K  for  hydro¬ 
gen.  This  makes  the  liquefaction  of  such  gases  quite  difficult. 

12.3.5  Besides  the  difference  in  the  molar  volumes  of  boiling 
liquid  and  dry  saturated  vapour,  the  specific  heat  of  vaporiza¬ 
tion  (13.6.4)  also  vanishes  when  a  substance  is  in  the  critical 
slate,  as  does  the  surface  tension  (13.4.4).  The  difference  be- 


204 


REAL  GASES  AND  VAPOURS  [12 


tween  the  liquid  and  gaseous  states  of  a  substance  completely 
disappears  when  it  is  in  the  critical  state. 

12.3.6  The  Van  der  Waals  equation  (12.2.4)  is  a  cubic  equation 
with  respect  to  the  molar  volume  F|x,  having  factors  that  de¬ 
pend  upon  the  pressure,  temperature  and  the  chemical  nature 


of  the  gas.  This  equation  has  either  one  or  three  real  roots,  in 
accordance  with  the  numerical  values  of  p  and  T.  Shown  in 
Fig.  12.5  are  the  isothermals  of  a  real  gas  that  complies  with 
the  Van  der  Waals  equation.  The  isolhermals  correspond  to  the 
different  temperatures  Tx  <  T2  <  T  <  T3  <  Tc  <  T $  <  7’6. 
All  the  subcritical  isolhermals  have  a  hatched  portion  where 
each  pressure  corresponds  to  three  different  siates,  which  are 
represented  by  the  three  points,  B,  E  and  G,  of  the  isothermal. 
The  wave-shaped  portion  BDEFG  of  the  isothermal  in  Fig.  12.5 
more  accurately  represents  the  transition  of  the  substance  from 
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l.lie  gaseous  to  the  liquid  state  than  the  horizontal  portions  of 
experimentally  plotted  isothermals  (dash  straight  lines  BG). 
The  portion  BD  of  the  isothermal  corresponds  to  a  superheated 
liquid,  which  can  be  obtained  if  the  beginning  of  boiling  is  de¬ 
layed  at  point  B.  The  portion  GF  of  the  isothermal  represents 
the  state  of  supersaturated  vapour,  occurring  upon  slow  isother¬ 
mal  compression  in  the  absence  of  condensation  centres.  If  such 
centres  (dust  specks  or  ions)  are  introduced  into  the  supersatu¬ 
rated  vapour,  extremely  rapid  condensation  of  the  vapour  takes 
place.  Over  the  portion  DBF,  the  molar  volume  increases  (de¬ 
creases)  as  the  pressure  is  increased  (decreased).  Such  states  of 
matter  are  impossible.  The  horizontal  steps  BG  cut  the  portions 
BDEFG  of  the  iso  thermals  in  such  manner  that  the  hatched  areas 
BDEB  and  EFGE  in  Fig.  12.5  are  equal  to  each  other  ( Max¬ 
well's  equal  area  rule). 

12.3.7  The  values  of  the  critical  variables  pc,  V^c  and  Ta  (12.3.1) 
can  be  expressed  in  terms  of  the  coefficients  a  and  b  in  the  Van 
der  Waals  equation  (12.2.4)  and  the  universal  gas  constant  R 
(8.4.4): 

la  8  a 

Pc  ~  27  b 3  ’  *VC  —  36  and  To—  27  hR  • 


12.4  Superfluidity  of  Helium 

12.4.1  Helium  is  a  kind  of  substance  which,  after  going  over 
from  the  gaseous  to  the  liquid  state  at  standard  atmospheric 
pressure  and  a  temperature  of  4.22  K ,  remains  in  the  liquid  state 
no  matter  how  close  to  absolute  zero  (11.8.4)  it  is  cooled. 

12.4.2  At  the  .temperature  2.19  K  a  second-order  phase  transi¬ 
tion  occurs:  liquid  helium  I,  existing  at  temperatures  T  > 
>  2.19  K,  undergoes  the  transition  to  liquid  helium  II, 
which  exists  at  T  <  2.19  K. 

A  second-order  phase  transition  is  a  transformation  of  matter 
that  is  not  associated  with  the  evolution  or  absorption  of  heat 
as  in  a  first-order  transition.  In  second-order  phase  transitions 
the  heat  capacity,  coefficient  of  thermal  expansion  and  certain 
other  characteristics  of  substances  vary  with  discontinuous 
jumps.  Some  examples  of  second-order  phase  transitions  are 
the  transition  of  iron  at  the  Curie  point  (26.5.2)  from  a  ferro¬ 
magnetic  substance  into  a  paramagnetic  one  (26.3.5)  and  the 
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transition  of  certain  metals  and  alloys  aL  extremely  low  tem¬ 
peratures  to  the  superconductive  slate  (41.6.1). 

12.4.3  Superfluidity,  a  phenomenon  observed  in  liquid  he¬ 
lium  II,  is  the  practically  complete  absence  of  viscosity  when 
such  helium  flows  through  narrow  capillary  tubes  (13.5.5). 
The  coefficient  of  viscosity  (10.8.4)  of  helium  II  is  less  than 
10~12  Pa-s,  while  that  of  helium  I  close  to  the  temperature 
4.22  K  is  of  the  order  of  10~6  Pa-s. 

12.4.4  According  to  the  two-fluid  theory,  liquid  helium  II  is  a 
mixture  of  a  superfluid  component  and  a  normal  component. 
The  superfluid  component  moves  without  friction  and  does  not 
participate  in  the  transfer  of  energy  in  the  form  of  heat  (9.2.1). 
The  normal  component  moves  with  friction  and  does  participate 
in  energy  transfer. 

12.4.5  The  theory  of  superfluidity  is  based  on  quantum  mechan¬ 
ics  (38.1.2).  In  the  first  place,  quantum  mechanics  was  required 
to  explain  why  helium  is  the  only  nonfreezing  liquid  at  ultra- 
low  temperatures  and  standard  pressure.  The  zero-point  vibra¬ 
tions  (38.5.6)  of  the  light-weight  atoms  of  helium  are  sufficient¬ 
ly  intensive  and  do  not  allow  the  weak  forces  of  attraction 
between  the  helium  atoms  to  form  a  crystalline  structure  at 
ordinary  pressures. 

The  present-day  theory  of  superfluidity  is  based  on  a  study  of 
the  energy  spectrum  of  helium  at  ultralow  temperatures.  Under 
these  conditions  the  continuous  energy  spectrum  can  be  regarded 
as  the  totality  of  quasi-particles  with  the  energies  hvit  where  h 
is  Planck’s  constant  and  v;  is  the  phonon  frequency  (41.7.5) 
corresponding  to  these  quasi-particles.  The  quasi-particles 
cannot  be  identified  with  the  real  atoms  of  helium.  They  repre¬ 
sent  the  motion  of  the  whole  system  of  helium  atoms,  and  the 
energy  values  of  the  quasi-particle  describe  the  energy  spectrum 
of  the  whole  quantum  system,  i.e.  the  liquid  helium  as  a  whole. 
At  low  temperatures  the  excited  slate  of  the  helium  constitutes 
sound  waves  that  are  quasi-particles  in  the  normal  part  of  he¬ 
lium  II.  Associated  with  the  quasi-particles  are  the  store  of 
internal  energy  and  the  presence  of  friction  in  the  liquid  he¬ 
lium.  But  states  are  possible  in  helium  II  t  hat  correspond  to  its 
superfluid  part,  in  which  quasi-particles  are  not  advantageous 
from  the  energy  point  of  view  and  do  not  arise.  As  a  result  of 
the  strong  interaction  between  the  particles  of  the  superfluid 
part  of  helium  II  a  bound  body  is  formed  in  which  no  thermal 
excitations  are  generated:  the  superfluid  part  of  helium  II  has 


13.1]  PROPERTIES  OF  LIQUIDS 


207 


no  store  of  internal  energy  and  is  not  subject  to  viscosity.  At 
absolute  zero  temperature,  there  should  actually  not  be  any 
normal  part  in  helium  II;  all  of  the  helium  should  be  superfluid. 
As  the  helium  is  heated,  the  number  of  phonons  increases  as 
does  the  share  of  the  normal  part  of  helium  II.  But  until  the 
temperature  reaches  2.19  K,  a  superfluid  part  of  helium  II, 
with  its  characteristic  properties,  still  remains.  At  the  temper¬ 
ature  2.19  K,  helium  II  is  converted  into  helium  I  and  all  the 
special  properties  of  helium  II  disappear. 


CHAPTER  13  LIQUIDS 

13.1  Certain  Properties  of  Liquids 

13.1.1  Liquids  are  bodies  that  have  a  definite  volume,  but  assume 
the  shape  of  the  vessel  that  holds  them.  On  the  thermal  mo¬ 
tion  in  liquids,  see  Sect.  8.1.5.  The  nature  of  the  thermal  mo¬ 
tion  in  liquids  determines  the  similarity  of  their  properties  with 
those  of  both  solids  and  gases.  Like  solids,  liquids  are  almost 
incompressible.  This  property  is  associated  with  the  strong 
intermolecular  interaction  of  the  particles  in  liquids.  When  liq¬ 
uids  are  compressed,  the  distances  between  the  molecules  are 
reduced  and  the  repulsive  forces  that  impede  compression  in¬ 
crease  drastically  (12.1.4).  Liquids  have  relatively  high  densities 
and,  like  solids,  resist,  not  only  compression,  but  tension  as 
well  (40. 3. C).  This  is  manifested  in  the  fact  that  the  Van  der 
Waals  isothermals  enter  the  region  of  negative  pressures  (the 
isothermal  at  T  =  T1  in  Fig.  12.5).  The  resemblance  of  the 
properties  of  liquids  and  real  gases  at  high  temperatures  and 
low  densities  is  observed,  for  instance,  in  that  the  surface  ten¬ 
sion  of  liquids  (13.4.4)  and  the  specific  heat  of  vaporization 
(13.6.4)  are  reduced  as  the  temperature  is  raised.  Besides,  the 
densities  of  dry  saturated  vapour  (12.3.2)  and  of  boiling  liquid 
(12.3.2)  approach  each  other  as  the  temperature  is  increased. 

13.1.2  The  similarity  between  liquids  and  solids  is  confirmed 
by  X-ray  diffraction  analysis  (33.4.1).  At  temperatures  close 
to  that  of  solidification,  the  arrangement  of  the  particles  in 
liquids  is  similar  to  the  ordered  arrangement  of  particles  that  is 
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typical  of  solidified  liquids.  The  mutual  arrangement  of  neigh¬ 
bouring  particles  in  liquids  resembles  the  ordered  arrangement 
of  neighbouring  particles  in  crystals.  In  liquids,  however,  this 
ordered  arrangement  is  observed  only  within  small  volumes.  At 
distances  r  >  (3  to  4)  d  from  a  certain  selected  “central”  mol¬ 
ecule,  order  is  no  longer  evident  (hero  d  is  the  effective  diameter 
of  the  molecule,  see  Sect.  8.4.1).  Such  order  in  the  arrangement 
of  the  particles  is  called  short-range  order  in  liquids. 

13.1.3  X-ray  patterns  of  liquids  do  not  differ  from  those  of 
polycrystalline  bodies  (33.4.5)  consisting  of  extremely  fine 
crystals  (having  linear  dimensions  of  the  order  of  10 ~9  m)  that 
are  randomly  oriented  with  respect  to  one  another  in  so-called 
cyhotaxic  regions.  Within  the  boundaries  of  these  regions  the 
particle  distribution  is  ordered,  but  the  kind  of  order  changes 
from  one  cyhotaxic  region  to  another.  In  the  course  of  time, 
intensive  thermal  motion  at  temperatures  that  are  not  too  low 
rapidly  alters  the  arrangement  and  structure  of  the  cyhotaxic 
regions. 


13.2  Frenkel's  Hole  Theory  of  the  Liquid  State 

13.2.1  The  most  important  parameter  determining  the  structure 
and  physical  properties  of  liquids  is  the  specific  volume.  In 
melting  a  crystalline  body  the  specific  volume  increases  only 
slightly,  by  approximately  10  per  cent.  Such  an  increase  in  the 
specific  volume  takes  place  in  a  solid  subject  to  negative  pres¬ 
sure  equal  to  the  ultimate  strength  (40.3.7)  of  the  solid.  This 
enables  a  liquid  to  be  dealt  with  as  a  body  whose  integrity  has 
been  violated  at  various  places.  In  melting  crystalline  bodies, 
their  particles  acquire  higher  mobility.  This  is  due  to  the  vital 
property  of  fluidity  of  liquids ,  as  well  as  the  violation  of  long- 
range  order  in  crystals  and  the  emergence  of  short-range  order  in 
liquids  (13.1.2).  Moreover,  owing  to  the  high  mobility  of  the 
particles,  microscopic  gaps  or  microvoids,  which  shall  bo 
called  holes,  appear  in  liquids.  The  thermal  motion  in  liquids 
causes  the  holes  to  disappear  at  random  at  certain  places  and  to 
appear  simultaneously  at  others.  This  is  equivalent  to  chaotic 
movement  of  the  holes. 

13.2.2  The  hole  theory  of  the  structure  of  liquids  is  inapplicable 
to  liquids  subject  to  high  external  pressures  of  the  order  of 
thousands  of  atmospheres,  at  which  the  compressibility  of 
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liquids  is  comparable  to  that  of  solids.  At  high  temperatures* 
close  to  the  critical  one*,  the  structure  and  properties  of  a  liq¬ 
uid  are  close  to  those  of  a  gas  for  which  the  concept  of  holes 
is  meaningless.  Hence,  the  hole  theory  is  inapplicable  for 
liquids  under  these  conditions. 

13.2.3  It  follows  from  the  nature  of  thermal  motion  in  liquids 
(8.1.5)  that  the  molecules  vibrate  about  a  certain  equilibrium 
position  for  the  length  of  time  x,  after  which  this  equilibrium 
position  is  displaced  with  a  jump  over  a  distance  of  an  order  of 
magnitude  equal  to  the  average  distance  ( d )  between  adjacent 
molecules: 


where  n0  is  the  number  of  molecules  in  unit  volume,  N a  is 
Avogadro’s  number  (Appendix  II),  p  is  the  density  of  the  liquid, 
and  p  is  its  molar  mass.  For  water,  for  example,  p  =  103  kg/m3, 
p  =  0.018  kg/mol  and  (d)  x  3  x  10~10  m. 

13.2.4  The  relaxation  lime  is  the  average  time  (t)  that  a  mol¬ 
ecule  of  the  liquid  remains  in  a  temporarily  settled  state  close  to 
a  certain  equilibrium  position.  As  the  temperature  is  raised 
(x)  decreases  rapidly  (13.2.5).  This  explains  the  high  mobility 
of  molecules  in  a  liquid  at  high  temperatures  and  the  low  vis¬ 
cosity  of  liquids  under  these  conditions. 

13.2.5  The  expenditure  of  a  certain  amount  of  activation  energy 
A'  is  required  for  a  molecule  to  pass  from  one  equilibrium  posi¬ 
tion  to  another  (13.2.4).  Such  a  passage  is  regarded  as  overcom¬ 
ing  a  potential  barrier  of  height  E  (38.7.1)  because  to  accom¬ 
plish  the  passage  the  potential  energy  of  the  molecule  should 
increase  by  the  amount  E.  Only  after  this  can  it  pass  over  to  a 
new  equilibrium  position.  The  whole  process  turns  out  to  be 
possible  because  high  energy  is  concentrated  on  some  of  the 
molecules  as  a  result  of  collisions  in  thermal  motion.  It  is 
transferred  to  them  by  other  molecules.  The  dependence  of  the 
relaxation  time  (13.2.4)  on  E  and  the  absolute  temperature  is 
of  the  form 

(x)  —xaeEthT , 


*  Substances  with  an  exceptionally  low  critical  temperature  are 
not  considered  here. 
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where  k  is  Boltzmann’s  constant  (8.4.5)  and  x0  is  the  average 
period  of  vibrations  of  the  molecule  about  its  equilibrium  posi¬ 
tion. 

13.2.6  When  a  liquid  is  subject  to  the  action  of  an  external 
force  for  the  length  of  time  t  >  (t)  the  particles  of  the  liquid 
are  displaced  mainly  in  the  direction  of  this  force.  This  reveals 
the  fluidity  of  the  liquid.  But  if  t  <  (x)  a  particle  has  no  oppor¬ 
tunity  to  change  its  equilibrium  position  during  the  time  of 
force  action.  At  this  the  liquid  manifests  elastic  properties, 
resisting  changes  in  both  volume  and  shape. 

During  the  time  (x)  a  particle  of  liquid  is  displaced,  on  an  av¬ 
erage,  over  the  distance  ( d ),  the  average  velocity  (v)  of  dis¬ 
placement  of  the  molecules  is  determined  by  the  equation 

=  H  or 

The  average  velocity  of  motion  of  molecules  in  liquids  is  quite 
considerable,  as  a  rule,  but  an  order  of  magnitude  less  than  the 
average  velocities  of  molecules  of  the  vapour  of  the  same  sub¬ 
stance  at  the  same  temperatures. 


13.3  Diffusion  and  Viscosity  Phenomena  in  Liquids 

13.3.1  If  conditions  required  for  transport  phenomena  (10.8.1) 
arise,  diffusion,  heat  conduction  and  viscosity  are  observed  in 
liquids.  The  difference  between  transport  phenomena  in  liquids 
and  similar  phenomena  in  gases  affects  the  transport  coefficients 
and  their  dependence  on  the  characteristics  of  properties  of  the 
liquids. 

13.3.2  For  a  chemically  homogeneous  liquid  the  diffusion  coef¬ 
ficient  D  (10.8.3)  is  calculated  by  the  equation 

D  =  1  (d)2  c-ElhT  ' 

6  T0 

The  quantities  are  defined  in  Sects.  13.2.3,  13.2.4  and  13.2.5. 
The  diffusion  coefficient  grows  rapidly  with  temperature.  This 
is  due,  for  the  most  part,  to  the  sharp  drop  in  the  relaxation 
time  (x)  (13.2.4).  Besides,  the  quantity  ( d )  (13.2.3)  increases 
somewhat  with  the  temperature  T. 
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i:t.3.3  When  the  temperature  approaches  the  critical  value 
(13.3.1),  the  average  velocity  (v)  of  the  particles  of  the  liquid 
approaches  the  average  velocity  of  molecules  of  a  real  gas  and 
l  lie  values  of  the  diffusion  coefficient  D  for  liquids  become  close 
to  those  for  gases. 

A  t  temperatures  much  lower  than  the  critical  one,  the  diffusion 
coefficients  for  liquids  are  extremely  small  compared  to  those 
Idr  the  corresponding  vapours  or  gases  at  ordinary  pressures. 
Kor  example,  D  w  1.5  X  10~9  m2/s  for  water  at  f  =  300  K, 
while  for  the  vapour  of  water  (steam)  in  air  at  the  same  temper¬ 
ature  and  at  standard  atmospheric  pressure  If  a  2  x  10_6m2/s. 
13.3.4  At  temperatures  close  to  the  critical  one*,  the  thermal 
motion  in  liquids  acquires  features  that  differ  from  those  de¬ 
scribed  in  Sect.  8.3.5  and  approach  those  of  thermal  motion  in 
gases.  Under  these  conditions  the  internal  friction  (viscosity)  in 
liquids  is  of  the  same  nature  as  in  gases  (10.8.4). 

At  temperatures  close  to  the  melting  point,  tne  viscosity  of 
liquids  cannot  be  accounted  for  in  the  same  way  as  for  gases. 
The  mechanism  by  means  of  which  viscosity  arises  is  a  complex 
one.  The  coefficient  of  dynamic  viscosity  q  (10.8.4)  for  liquids 
may  be  related  to  the  mobility  u0  of  the  molecule.  This  is  defined 
as  the  velocity  u  gained  by  a  molecule  subject  to  an  external 
force  F  equal  to  unity:  u0  =  ulF.  The  relation  between  r)  and 
i/0  is  an  inversely  propprtional  one:  q  oc  u^'.  In  its  turn  u0  oc 
a  D/kT,  where  D  is  the  diffusion  coefficient,  T  is  the  absolute 
temperature,  and, A:  is  Boltzmann’s  constant  (8.4.5).  Consequent¬ 
ly,  T)  oc  T/D  or  q  o c  T  exp  ( ElkT ),  where  E  is  the  activation 
energy  (13.2.5).  As  the  temperature  is' raised  the  viscosity  of 
liquids  drops  rapidly,  especially  in  the  region  of  low  tempera¬ 
tures.  At  high  pressures  the  viscosity  of  liquids  increases  rapidly 
with  pressure.  This  is  due  to  the  increase  in  the  activation  en¬ 
ergy  (13.2.5)  and  the  corresponding  increase  in  the  relaxation 
lime  (13.2.4). 


13.4  Surface  Tension  of  Liquids 

13.4.1  A  molecule  of  a  liquid  in  the  surface  layer  is  subject  to 
uncompensated,  inwardly  directed  forces  of  attraction  exerted 
by  the  rest  of  the  liquid.  As  a  result,  the  surface  layer  exerts  a 


*  See  the  footnote  on  page  209. 
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high  internal  pressure  on  the  whole  liquid.  It  is  of  the  order  of 
tens  of  thousands  of  atmospheres. 

13.4.2  Particles  of  the  surface  layer  of  a  liquid  have  a  higher 
potential  energy  than  those  inside  the  liquid.  The  reason  for 
this  is  that  in  order  to  emerge  from  inside  the  liquid  to  its 
surface  the  molecules  must  do  work  to  overcome  the  inwardly 
directed  forces  of  internal  pressure  (13.4.1).  This  work  increases 
the  potential  energy  of  the  molecules  that  reach  the  surface. 

The  work  W  that  must  be  performed  to  increase  the  area  of  the 
surface  layer  of  a  liquid  isothermally  is  equal  to 

W  —  (( FS-FV))N , 

where  (( F&  —  Fv))  is  the  average  difference  between  the  free 
energy  (11.4.5)  per  molecule  at  the  surface  Fs  and  within  the 
bulk  Fv  of  the  liquid,  and  N  is  the  number  of  molecules  in  the 
surface  layer  of  the  liquid. 

13.4.3  The  condition  of  minimum  potential  energy  of  a  liquid, 
required  to  maintain  its  stable  equilibrium,  is  complied  with 
when  the  liquid  has  its  minimum  possible  free  surface  area.  The 
state  of  stable  equilibrium  of  a  liquid  incompressible  body  cor¬ 
responds  to  the  minimum  ratio  of  its  surface  area  to  its  volume. 
Therefore,  small  drops  of  liquid  suspended  in  air  are  of  spheri¬ 
cal  shape.  A  liquid  tends  to  reduce  the  area  of  its  free  surface. 
Consequently,  the  surface  layer  resembles  a  stretched  elastic 
film  in  which  tensile  forces  act  (see  also  Sect.  13.4.5). 

13.4.4  The  work  done  in  the  isothermal  formation  of  unit  sur¬ 
face  area  is  called  the  surface  tension  a  of  the  given  liquid  at 
the  boundary  with  another  phase  (12.3.3): 

o=  ~  =  ((FS-FV))  JL=({Fb-Fv))  nlt 

where  nx  =  N/A  is  the  number  of  molecules  per  unit  area  of 
the  surface  layer.  The  surface  tension  is  also  calculated  by  the 
equation 


where  A F  is  the  change  in  the  free  energy  of  the  surface  layer, 
and  A  A  is  the  change  in  the  surface  area. 

The  surface  tension  a  depends  upon  the  chemical  composition 
of  the  liquid  and  its  temperature.  It  decreases  when  the  tem- 
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porature  is  raised  and  vanishes  at  the  critical  temperature 
(12.3.1).  The  surface  tension  can  be  reduced  by  adding  surface- 
uciive  substances  to  the  liquid.  This  is  due  to  the  fact  that  such 
substances  are  adsorbed  on  the  surface  layer  of  the  liquid  and 
reduce  the  free  energy  of  this  layer  (11.4.5). 

Li. 4. 5  There  is  an  essential  difference  between  the  surface 
layer  of  a  liquid  and  an  elastic  film  (13.4.3).  The  surface  ten¬ 
sion  of  liquids  does  not  depend  upon  the  surface  area  and  tends 
lo  reduce  this  area  to  zero.  The  tension  of  an  elastic  film  is  pro¬ 
portional  to  its  deformation  and  equals  zero  at  a  definite  finite 
area  of  the  film’s  surface. 

This  peculiarity  in  the  properties  of  liquid  films  is  associated 
with  the  change  in  the  number  of  molecules  in  the  surface  layer 
when  these  films  are  stretched  (or  compressed).  But  the  average 
distances  between  the  molecules  and  the  forces  of  intermolecular 
interaction  determined  by  these  distances  do  not  change.  This 
is  why  the  magnitude  of  the  surface  tension  is  independent  of 
Iho  area  of  the  free,  surface  of  the  liquid. 

13.4.6  If  the  surface  of  a  liquid  is  bounded  by  a  wetted  per¬ 
imeter  (13.5.1),  then  the  quantity  a  is  equal  to  the  force  acting 
on  unit  length  of  the  wetted  perimeter,  perpendicular  to  this 
perimeter.  This  force  lies  in  a  plane  tangent  to  the  free  surface 
of  the  liquid. 


13.5  Wetting  and  Capillary  Phenomena 

13.5.1  The  free  surface  of  a  liquid,  curved  adjacent  to  the  wall 
of  the  vessel  holding  the  liquid,  is  called  the  meniscus.  The  line 
along  which  the  meniscus  contacts  the  wall  is  called  the  wetted 
perimeter.  The  meniscus  is  characterized  by  the  contact  angle  0 
between  the  wetted  surface  of  the  wall  and  the  meniscus  at  their 
point  of  contact.  When  0  <  jt/2  (Fig.  13.1a),  the  liquid  is  said 
to  wet  the  wall  (to  be  a  wetting  liquid);  when  0  >  nl 2  (Fig.  13. li) 
the  liquid  does  not  wet  the  wall  and  is  a  nonwetting  liquid. 
Wetting  (nonwetting)  is  said  to  be  ideal  when  0  =  0  (0  =  n). 
Here  the  meniscus  is  of  spherical  shape  and  is  concave  or  con¬ 
vex.  The  absence  of  both  wetting  and  nonwetting  corresponds  to 
the  condition  0  =  jt/2,  in  which  case  the  liquid  has  a  plane 
free  surface. 

13.5.2  The  formation  of  a  meniscus  is  due  to  the  interaction  of 
the  molecules  with  one  another  and  with  the  particles  of  the 
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rigid  body  (wall).  Molecule  A  of  the  surface  layer,  located  close 
to  the  wall  of  the  vessel  and  having  a  sphere  of  molecular  action 
(12.2.3)  of  radius  Rm  (range  of  molecular  action)  (Fig.  13.2),  is 
subject  to  the  resultant  force  of  attraction  exerted  on  it  by  all 
the  other  molecules  of  the  liquid  (F2)  and  by  all  the  particles 
of  the  wall  (F2).  Force  F2  is  perpendicular  to  the  wall.  This  fol¬ 
lows  from  considerations  of  symmetry.  The  direction  of  force 
Fx  depends  upon  the  shape  of  the  meniscus  and  the  position  of 


(a)  ( b ) 


Fig. 13.1 


molecule  A  with  respect  to  the  wall.  For  example,  if  the  menis¬ 
cus  is  flat  (13.5.1)  and  molecule  A  is  right  at  the  wall,  force  Ft 
is  directed  at  an  angle  of  45°  to  the  wall  (Fig.  13.3).  Molecule  A 
can  be  in  equilibrium  only  when  the  resultant  force  F  =  Fj  -f- 
+  F2  is  perpendicular  to  the  surface  of  the  liquid.*  Otherwise, 
molecule  A  would  move  along  this  surface. 

13.5.3  The  shape  of  the  meniscus  is  determined  by  the  three 
possible  directions  of  force  F: 

(a)  force  F  is  parallel  to  the  surface  of  the  wall,  the  liquid  has 
a  plane  surface  and  0  =  n/2  (Fig.  13.4a); 

(b)  force  F  is  directed  towards  the  wall;  the  forces  of  attraction 
exerted  by  the  wall  on  molecule  A  exceed  the  forces  of  attrac¬ 
tion  exerted  on  the  molecule  by  all  the  other  molecules  of  the 


*  Here  we  ignore  the  gravity  force  acting  on  molecule  A  because 
it  is  negligibly  small  compared  to  forces  Fj  and  F2. 
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liquid.  The  liquid  has  a  concave  meniscus  with  0  <  Jt/2,  i.e. 
the  liquid  wets  the  wall  (Fig.  13.4.6); 

(c)  force  F  is  directed  towards  the  liquid;  the  forces  of  attrac- 
lion  exerted  by  the  molecules  of  the  liquid  on  molecule  A 


Fig.  13.2  Fig.  13.3 

prevail  over  those  exerted  on  it  by  the  particles  of  the  wall.  The 
meniscus  is  convex  with  0  >  ji/2,  i.e.  the  liquid  does  not  wet 
I  lie  wall  (Fig.  13.4c). 

13.5.4  The  curved  shape  of  the  surface  layer  is  the  cause  of  the 
pressure  A p,  additional  to  the  external  pressure,  acting  on  the 


liquid.  This  additional  pressure  is  due  to  the  surface  tension. 
This  is  similar  to  the  way  in  which  a  stretched  elastic  envelope 
(like  a  rubber  balloon)  exerts  pressure  on  the  gas  it  contains. 
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The  additional  pressure  exerted  on  a  liquid  with  a  surface  layer 
of  arbitrary  shape  is  calculated  by  the  equation 


where  a  is  the  surface  tension  (13.4.4),  and  Rt  and  i?2  are  the 
radii  of  curvature  of  any  two  mutually  perpendicular  normal 
sections*  of  the  surface  of  the  liquid  at  the  point  where  mol¬ 
ecule  A  is  located.  The  radius  of  curvature  Rx  (or  7?2)  is  taken 
positive  if  its  centre  in  the  corresponding  section  is  inside  the 
liquid.  Otherwise,  the  radius  of  curvature  is  taken  to  bo  nega¬ 
tive.  Thus,  Ap  >  0  when  the  meniscus  is  convex  and  Ap  <  0 
when  it  is  concave.  In  the  case  of  a  plane  surface  R1  =  R2  = 
=  co,  and  there  is  no  additional  pressure  (Ap  =  0).  The  pre¬ 
ceding  equation  is  called  the  Laplace  theorem.  For  a  spherical 
surface  Rx  =  R2  =  R  and  Ap  =  2 a/R.  Such  a  surplus  pressure 
exists,  for  instance,  within  a  bubble  of  gas  of  radius  R,  located 
inside  a  liquid  near  the  surface. 

The  surplus  pressure  inside  a  soap  bubble  of  radius  R  is  due  to 
the  action  of  both  superficial  layers  of  the  thin  spherical  film 
of  soap  (inside  and  outside  the  bubble).  Thus  Ap  =  4 a/R. 

13.5.5  The  level  of  a  liquid  in  narrow  cylindrical  vessels  ( capil¬ 
lary  tubes)  of  radius  r  differs  from  the  level  of  the  liquid  in  a 
larger  vessel  into  which  the  capillary  tube  is  placed  vertically. 
The  level  in  the  capillary  tube  is  higher  (lower)  than  in  the  lar¬ 
ger  vessel  by  the  amount  h  if  the  liquid  wets  (does  not  wet)  the 
walls  of  the  vessel.  Here 

,  2 a  cos  0 

h— - 

rg  P 

where  0  is  the  contact  angle  (13.5.1),  p  is  the  density  of  the  liq¬ 
uid,  and  g  is  the  acceleration  due  to  gravity  (7.3.3).  This  is 
called  Jurin's  rule. 

If  the  liquid  is  in  a  narrow  slit  of  constant  width  8  between  two 
parallel  plates,  the  meniscus  is  of  cylindrical  shape  having 


*  A  normal  section  of  the  surface  through  point  A  is  the  curve 
obtained  as  the  intersection  of  the  surface  and  a  plane  passed 
through  the  normal  to  the  surface  at  this  point.  Radii  Rx  and  R2 
are  called  the  principal  radii  of  curvature  of  the  surface  layer. 
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the  radius  6/2  and  the  level  in  the  slit  is  above  (for  a  wetting 
liquid)  or  below  (for  a  nonwetting  liquid)  the  level  in  the  vessel 
in  which  the  plates  are  immersed  by  the  amount 

,  2a  cos  0 


I  .(.5.6  The  pressure  of  saturated  vapour  (12.3.2)  above  the 
curved  surface  of  a  liquid  depends  upon  the  shape  of  the  menis¬ 
cus.  If  the  meniscus  is  concave  (convex),  the  pressure  of  the 
saturated  vapour  is  less  (greater)  than  that  above  a  plane  sur¬ 
face  by  the  amount  A pras: 

APms  =  p^p  A p, 

where  p  is  the  density  of  the  saturated  vapour,  Pj  is  the  density 
of  the  liquid,  and  A p  is  the  additional  pressure  due  to  the  cur¬ 
vature  of  the  surface  (13.5.4). 


13.6  Vaporization  and  Boiling  of  Liquids 

13.6.1  Vaporization,  or  evaporation,  is  the  process  of  vapour 
formation  at  the  free  surface  of  a  liquid.  It  takes  place  at  any 
lemperature  and  increases  as  the  temperature  is  raised.  There 
are  molecules  in  the  surface  layer  of  a  liquid  that  have  high 
velocities  and  kinetic  energies  of  thermal  motion.  The  escape 
of  such  molecules  from  the  surface  into  the  space  above  the  liq¬ 
uid  is  the  explanation  for  evaporation  with  the  consequent 
reduction  of  the  store  of  internal  energy  and  cooling  of  the  liq¬ 
uid.  A  measure  of  the  process  of  vaporization  is  the  rate  of 
evaporation  u,  which  is  the  amount  of  liquid  converted  into 
vapour  in  unit  time.  The  rate  u  depends  upon  the  external  pres¬ 
sure  and  the  motion  of  the  gaseous  phase  above  the  free  surface 
of  the  liquid.  Thus 

cA  . 

u — — —  (pv  —  P)i 

P  o 

where  c  is  a  constant,  A  is  the  area  of  the  free  surface  of  the  liq¬ 
uid,  pv  is  the  pressure  of  saturated  vapour,  p  is  the  pressure 
of  the  vapour  above  the  free  surface  of  the  liquid  being  vapor¬ 
ized,  and  p0  is  the  external  barometric  pressure. 
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13.6.2  Boiling  is  the  process  of  intense  vaporization  of  a  liquid, 
not  only  from  its  free  surface,  but  throughout  the  whole  volume 
of  the  liquid  within  the  bubbles  of  vapour  that  are  produced 
in  boiling.  The  bubbles  of  vapour  in  a  boiling  liquid  rapidly 
increase  in  size,  rise  to  the  surface  and  burst.  This  causes  the 
turbulence  in  a  boiling  liquid.  The  pressure  p  within  a  bubble 
of  vapour  located  within  the  liquid  is  the  sum  of  the  external 
pressure  p0,  the  hydrostatic  pressure  p\  of  the  overlying  layers 
of  liquid,  and  the  additional  pressure  A p  due  to  the  surface 
tension  (13.5.4).  Thus 

P  —  Po  +  Pi  +  AP> 
and 

P\  —  pgh  and  A  P  =  -^~, 

where  r  is  the  radius  of  the  vapour  bubble,  h  is  the  distance 
from  its  centre  to  the  surface  of  the  liquid,  and  p  and  o  are 
the  density  and  surface  tension  of  the  liquid. 

The  boiling  of  a  liquid  begins  at  the  temperature  at  which  the 
pressure  pv  of  the  saturated  vapour  inside  a  bubble  is  at  least 
as  high  as  the  pressure  p  (13.6.2): 

Pv>Po  +  Pgh+ 

If  this  condition  is  not  complied  with,  the  bubble  collapses  and 
the  vapour  it  contains  is  condensed. 

13.6.3  With  small  radii  r  of  the  vapour  bubbles,  the  pressure  py 
should  be  quite  high  and  it  is  necessary  to  heat  the  liquid  to  a 
high  temperature  before  it  starts  boiling.  If  the  liquid  contains 
centres  of  vaporization  (dust  specks,  bubbles  of  dissolved  gases, 
etc.),  boiling  begins  at  a  much  lower  temperature.  This  is  due 
to  the  fact  that  the  vapour  bubbles  that  appear  on  the  centres 
of  vaporization  are  of  such  a  size  that  the  effect  of  the  third 
member  in  the  inequality  of  Sect.  13.6.2  can  be  neglected.  More¬ 
over,  pgh  <€.  p o  as  a  rule,  so  that  the  approximate  condition  for 
the  beginning  of  boiling  can  be  written  in  the  form 


Pv  Po- 
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The  temperature  of  a  liquid  at  which  the  pressure  of  its  saturat¬ 
ed  vapour  is  equal  to  the  external  pressure  is  called  the  boil¬ 
ing  point  of  the  liquid. 

13.6.4  When  a  liquid  boils  at  constant  pressure,  its  temper¬ 
ature  remains  constant.  The  quantity  of  heat  transferred  to  a 
boiling  liquid  is  completely  expended  in  converting  the  mole¬ 
cules  of  the  liquid  into  vapour.  The  heat  ;  j,  required  to  vapor¬ 
ize  unit  mass  of  a  liquid  heated  to  the  boiling  point  is  called 
the  heat  of  vaporization.  The  value  of  rb  decreases  as  the  boil¬ 
ing  point  is  raised  and  vanishes  at  the  critical  temperature 
(12.3.1). 

The  change  in  the  internal  energy  of  a  liquid  (9.1.1)  as  a  unit 
of  its  mass  is  converted  into  vapour  at  the  boiling  point  is 
called  the  internal  heat  of  vaporization. 

13.6.5  The  boiling  of  liquids  and  the  condensation  of  vapour  are 
examples  of  first-order  phase  transitions  (cf.  Sect.  12.4.2).  Typ¬ 
ical  of  such  phase  transitions  are  the  simultaneous  constant 
values  of  pressure  and  temperature,  but  the  changing  ratio  of 
the  masses  of  the  two  phases  (12.3.3).  For  a  first-order  phase 
transition  to  occur,  the  heat  ?b  ( specifie  latent  heat  of  vaporiza¬ 
tion)  must  be  supplied  to  or  taken  away  from  the  system.  The 
heat  ;'b  per  unit  mass  is  calculated  by  the  Clausius-Clapeyron 
equation'. 

rb  =  (v 2— i>i)  T  r , 

where  iq  and  v2  are  the  specific  volumes  of  the  substance  in  the 
initial  and  final  phases,  T  and  p  are  the  temperature  and  pres¬ 
sure  of  the  phase  transition. 

13.6.6  It  follows  from  the  Clausius-Clapeyron  equation  for  the 
boiling  of  liquids  that 

dT _  (av  —  e])  T 

dp  rb 

where  v\  and  i;v  are  the  specific  volumes  of  the  liquid  and  vapour 
at  the  boiling  point  T.  Since  vv  >  vj  and  rb  >  0,  dT/dp  >  0, 
i.e.  the  boiling  point  increases  with  pressure.  For  instance,  at 
the  pressure  p  =  1.25  X  107  Pa  water  can  be  heated  without 
boiling  to  a  temperature  at  which  lead  can  be  melted  in  the 
water.  The  melting  point  of  lead  is  900  K, 
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CHAPTER  14  ELECTRIC  CHARGES. 

COULOMB'S  LAW 

14.1  Introduction 

14.1.1  Electrostatics  is  the  branch  of  the  study  of  electricity 
that  deals  with  the  interaction  and  properties  of  a  system  of 
electric  charges  that  are  fixed  with  respect  to  the  chosen  inertial 
frame  of  reference  (2.1.2). 

There  are  two  kinds  of  electric  charges,  positive  and  negative. 
Bodies  having  unlike  charges  attract  one  another;  those  having 
like  charges  repulse  one  another. 

14.1.2  The  electric  charge  of  any  system  of  bodies  consists  of  a 
whole  number  of  elementary  charges,  each  equal  to  1.6  X 
X  10  ~19  C  (where  C  is  the  abbreviation  of  coulomb,  the  unit  of 
charge).  The  stable  particle  with  the  minimum  rest  mass  (5.6.1) 
and  a  negative  elementary  charge  is  the  electron.  The  rest  mass 
of  the  electron  is  9.1  X  10~31  kg.  The  stable  particle  with  the 
minimum  rest  mass  and  a  positive  elementary  charge  is  the 
positron  (43.5.1).*  Its  rest  mass  is  the  same  as  that  of  the  elec¬ 
tron.  Besides,  there  is  a  stable  particle  with  a  positive  elemen¬ 
tary  charge  called  the  proton.  The  rest  mass  of  the  proton  is 
1.67  X  10~27  kg.  Electrons  and  protons  are  found  in  the  atoms 
of  all  the  chemical  elements. 

14.1.3  The  law  of  conservation  of  electric  charge  states:  the 
algebraic  sum  of  the  electric  charges  of  bodies  or  particles  that 
constitute  an  electrically  isolated  system  remains  constant  re¬ 
gardless  of  any  processes  that  occur  in  the  system. 

New  electrically  charged  particles  may  be  formed  in  the  system 
being  considered.  These  may  be  electrons  formed  by  the  ion¬ 
ization  of  atoms  or  molecules  (22.4.1),  ions  formed  by  the  phe- 

*  Here  we  do  not  consider  the  instability  of  the  positron,  com¬ 
mon  in  our  world,  that  is  associated  with  the  annihilation  of 
an  electron-positron  pair  (43.5.1). 
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nomenon  of  electrolytic  dissociation,  etc.  If  the  system  is  elec¬ 
trically  isolated,  however,  then  the  algebraic  sum  of  the  charges 
of  all  the  newly  formed  particles  of  the  system  always  equals 
zero.  The  law  of  the  conservation  of  electric  charge  is  one  of 
the  fundamental  conservation  laws,  like  the  laws  of  conservation 
of  momentum  (2.7.1)  and  of  energy  (3.4.2). 

14.1.4  Upon  contact  of  two  electrically  neutral  bodies,  charges 
go  over  from  one  body  to  the  other  as  a  result  of  friction.  In 
each  of  such  two  bodies  there  is  no  longer  an  equal  number  of 
positive  and  negative  charges  and  the  bodies  become  oppo¬ 
sitely  charged. 

When  a  body  is  electrized  by  induction,  the  charges  in  it  are  no 
longer  uniformly  distributed.  The  charges  are  redistributed  so 
that  one  part  of  the  body  has  an  excess  of  positive  charges  and 
the  other  part,  of  negative  charges.  If  these  two  parts  of  the 
body  are  separated  in  some  way,  they  will  be  oppositely  charged. 


14.2  Coulomb's  Law 

14.2.1  Coulomb  experimentally  established  that  the  force  of 
interaction  Flz  between  two  small  charged  spheres,  with  charges 
equal  to  qx  and  q2  (14.1.1),  is  proportional  to  the  product  qxq2 
and  inversely  proportional  to  the  square  of  the  distance  r  be¬ 
tween  the  spheres: 


where  kx  is  the  proportionality  factor  ( k a  >  0). 

14.2.2  Central  forces  act  on  the  charges  along  a  straight  line 
joining  the  centres  of  the  charges.  In  the  interaction  of  like 
charges  the  product  qxq2  >  0  and  F12  >  0.  This  corresponds  to 
mutual  repulsion.  For  unlike  charges  qxq2  <  0  and  F12  <  0, 
which  corresponds  to  mutual  attraction.  The  vector  form  of  the 
Coulomb  law  is 


F12  —  kt 


1 1?2  r12 
r2  r  ’ 


where  FJ2  is  the  force  exerted  on  charge  qx  by  charge  q2,  r12  is 
the  radius  vector  joining  charge  q2  to  on,  and  r  =  |  r12  | 
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(Fig.  14.1a).  The  force  ^2lJ  exerted  on  charge  q2  by  charge  qu 
equals 


F  —  k  r‘ 

r  2i  —  k1  -z- 
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where  r21  =  — r12  is  the  radius  vector  joining  charges  q1  and  q2 
(Fig.  14.16).  Forces  Fi2  and  F2i  are  said  to  be  Coulomb  forces. 
14.2.3  Coulomb’s  law  is  valid  for  the  interaction  of  point  elec¬ 
tric  charges  that  are  fixed  in  the  given  inertial  frame  of  refer¬ 
ence.  Point  charges  are  charged 
_  bodies  whose  linear  dimensions 
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_ o  d  are  very  much  smaller  thanj 

q  >0  the  distance  r  between  them 
(d  <  r).  1 

Coulomb’s  law  states:  the  force 
of  electrostatic  interaction  be¬ 
tween  two  point  electric  charges 
is  proportional  to  the  product  of 
the  magnitudes  of  the  charges, 
inversely  proportional  to  the 
square  of  the  distance  between 
them  and  acts  along  the  straight 
line  joining  them. 

In  the  form  given  in  Sect.  14.2.2  Coulomb’s  law  is  also  valid 
for  the  interaction  of  charged  spheres  of  radii  Rl  and  R2  if  the 
charges  q,  and  q2  of  the  spheres  are  uniformly  distributed  over 
their  surfaces  or  throughout  their  volumes.  Radii  Rx  and  R2 
may  be  commensurable  with  the  distance  r  between  the  centres 
of  the  spheres. 

14.2.4  A  Coulomb  force  (14.2.2)  depends  upon  the  properties  of 
the  medium  in  which  the  interacting  charges  are  located.  This 
dependence  can  be  taken  into  consideration  if  we  consider  the 
factor  /<q  in  the  formulas  of  Sects.  14.2.1  and  14.2.2  as  the  ratio 
of  two  factors: 

k1=± 

1  er 

where  k  is  a  factor  depending  only  on  the  choice  of  the  system 
of  units  of  measurement  and  er  is  the  dimensionless  relative 
permittivity ,  or  dielectric  constant,  of  the  medium  and  character- 
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izes  its  electric  properties.  It  is  assumed  here  that  the  medium 
is  boundless,  homogeneous  and  isotropic,  i.e.  that  its  properties 
are  the  same  throughout  its  volume  and  are  independent  of  the 
direction.  For  a  vacuum,  er  is  considered  to  equal  unity. 

'  14.2.5  For  a  medium  with  the  relative  dielectric  constant  er, 
Coulomb’s  law  takes  the  form 


erH 

For  charges  g1  and  q2  located  in  vacuum  (sr  =  1), 


From  this  follows  the  meaning  of  the  relative  dielectric  con¬ 
stant 

er  =  FJF. 

14.2.6  The  concept  of  er,  introduced  in  Sects.  14.2.4  and  14.2.5, 
has  a  meaning  only  under  the  conditions  stipulated  in  Sect. 
14.2.4.  An  er-fold  reduction  of  F  is  due  to  a  phenomenon 
known  as  eleclrostriction,  i.e.  the  deformation  of  a  dielectric 
(18.1.1)  from  tiie  effect  of  charged  bodies.  In  their  deformation, 
liquid  and  gaseous  dielectrics  are  directly  adjacent  to  the  charged 
bodies  and  exert  an  additional  mechanical  action  on  them, 
reducing  the  Coulomb  force  (14.2.2).  In  solid  dielectrics,  charges 
q,  and  q2  on  the  bodies  may  be  located  within  certain  cavities. 
This  makes  the  calculation  of  forces  Fla  and  F21  (14.2.2)  con¬ 
siderably  more  difficult.  These  forces  depend  on  the  shapes  of 
the  cavities  in  which  the  charges  q1  and  c/2  are  located.  In  this 
case  the  interpretation  of  er  given  in  Sect.  14.2.5  cannot  be 
correct. 

14.2.7  In  SI  units  (Appendix  I)  the  factor  k  in  the  formulas  of 
Sect.  14.2.5  are  taken  equal  to 


and  Coulomb’s  law  is  written  in  the  form 

p _  1  QiQz 

4jte0  err? 
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This  form  of  Coulomb’s  law  and  of  all  laws  following  from  it  is 
said  to  be  rationalized. 

The  quantity  e0  is  called  the  permittivity  of  free  space  or,  accord¬ 
ing  to  the  SI  units,  the  electric  constant.  It  is  equal  to  8.85  X 
X  10~12  C2/N-m2  (Appendix  II). 

The  product  e0er  is  sometimes  called  the  absolute  dielectric  con¬ 
stant,  or  absolute  permittivity  of  the  medium. 

In  the  cgse  system  of  units  (Appendix  I),  Coulomb’s  law  is 
written  in  the  following  irrational  form: 

D _  9l?2 


CHAPTER  15  ELECTRIC  FIELD  STRENGTH 
AND  DISPLACEMENT 

15.1  Electric  Field.  Field  Strength 

15.1.1  A  force  field  is  a  form  of  matter  that  is  studied  in  physics 
along  with  its  study  of  other  forms  of  matter  known  common¬ 
ly  as  substances.  The  most  basic  feature  of  force  fields  is  that 
various  kinds  of  interaction  are  accomplished  by  means  of 
them.  A  gravitational  field  (6.2.1),  for  instance,  accomplishes 
gravitational  interaction  between  bodies  of  various  masses 
located  in  the  field.  All  force  fields  have  vital  properties  that 
characterize  their  material  nature.  Primarily  this  concerns  their 
energy.  There  are  no  impassible  barriers  between  a  field  and 
material  substances;  one  can  be  converted  into  the  other 
(43.5.5). 

A  physical  field  cannot  be  defined  as  space  in  which  certain 
forces  act.  Space,  like  time,  is  a  form  of  the  existence  of  matter. 
The  fact  that  a  field  exists  in  space  by  no  means  infers  that  the 
field  can  be  identified  with  space,  because  the  form  of  existence 
of  matter  cannot  be  confused  with  the  matter  itself. 

15.1.2  Coulomb  interaction  (14.2.2)  between  fixed  electrically 
charged  particles  or  bodies  is  accomplished  through  the  elec¬ 
trostatic  field  set  up  by  the  charges.  An  electrostatic  field  is  a 
steady-state  electric  field,  i.e.  one  that  does  not  change  in  the 
course  of  time,  and  is  set  up  by  fixed  electric  charges.  This  is 
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me  of  the  forms  of  an  electromagnetic  field  by  means  of  which 
Mieraction  is  accomplished  between  electrically  charged  par- 
licles  (or  bodies)  that  travel,  in  the  general  case,  in  some  arbi- 
Irary  way  with  respect  to  the  chosen  frame  of  reference. 

A  typical  property  of  an  arbitrary  electric  field,  distinguishing 
it  from  all  other  physical  iields,  is  the  action  it  exerts  on  both 
l ravelling  and  fixed  electric  charges  (charged  particles  and 
bodies). 

15.1.3  The  quantitative  force  characteristic  of  the  action  exert- 
ril  by  an  electric  field  on  charged  particles  and  bodies  is  the 
vc.itor  E  of  the  electric  field  strength  (formerly  called  the  elec- 
Iric  intensity).  The  electric  held  strength  at  a  given  point  is 
l  lie  ratio  of  the  force  F,  exerted  by  the  held  on  a  fixed  point 
(14.2.3)  electric  test  charge  placed  at  the  point  of  the  held  being 
considered,  to  the  magnitude  qa  of  the  charge: 


i: 


1 1  is  assumed  that  the  test  charge  q0  is  so  small  that  its  pres¬ 
ence  does  not  lead  to  a  redistribution  in  space  of  the  charges 
that  set  up  the  held  being  investigated.  In  other  words,  the 
lest  charge  does  not  distort  the  field. 

An  electric  field  is  said  to  be  uniform  (a  uniform  electric  field )  if 
t ho  electric  field, vector  E  has  a  constant  magnitude  and  direc- 
lion  at  any  point  of  the  field. 

15.1.4  The  strength  of  the  electrostatic  field  of  point  charge  q 
at  a  point  located  at  the  distance  r  from  the  charge  is 


(in  SI  units) 

K  ==  ^  -  r  (in  cgse  units) 


where  r  is  the  radius  vector  joining  charge  q  and  the  point  at 
which  the  field  strength  is  to  be  calculated.  Vectors  E  are  direct¬ 
ed  at  all  points  of  the  field  radially  away  from  charge  q  when 
</  >  0,  and  directed  radially  toward  the  charge  when  q  <  0. 
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The  projection  Er  of  the  field  strength  on  the  direction  of  radius 
vector  r  is 

(in  SI  unitS) 

Er~ ~~'r2  (>n  cgse  units). 

These  formulas  are  also  used  to  calculate  the  strength  of  the 
field  set  up  by  a  hollow  sphere,  having  the  charge  q  on  its  sur¬ 
face,  at  the  distance  r  from  the  centre  of  the  sphere  whose  radius 
equals  R.  Inside  the  hollow  sphere  Er  =  0. 

15.1.5  The  force  F  exerted  by  an  electric  field  on  an  arbitrary 
charge  q  located  at  a  given  point  in  the  field  is 

F  =  §E, 

where  E  is  the  field  strength  at  the  point  where  charge  q  is  lo¬ 
cated  and  is  distorting  the  field,  i.e.  making  it  different  from 
the  field  set  up  before  introducing  charge  q. 

15.1.6  Electrostatic  fields  are  represented  graphically  by  employ¬ 
ing  lines  of  force. 

Lines  of  force  are  imaginary  curves  drawn  in  such  a  way  that 
their  direction  at  any  point  (i.e.  the  direction  of  the  tangent  to 
the  line  at  the  point)  coincides  with  that  of  the  electric  field 
vector.  Lines  of  force  are  assumed  to  have  the  same  direction 
as  the  corresponding  electric  field  vectors.  Lines  of  force  do  not 
intersect  because  at  each  point  of  the  field  vector  E  has  only  a 
single  direction. 

Lines  of  force  are  not  identical  with  the  paths  of  motion  of 
light  charged  particles  in  an  electrostatic  field.  Tangent  at  each 
point  of  the  path  of  a  particle  is  its  velocity.  Tangent  to  a  line 
of  force  is  the  force  acting  on  a  charged  particle.  This,  conse¬ 
quently,  corresponds  to  the  acceleration  of  the  particle. 

15.2  Principle  of  Superposition  of  Electric  Fields 

15.2.1  The  basic  problem  of  electrostatics  can  be  formulated  as 
follows:  find  the  magnitude  and  direction  of  the  electric  field 
vector  E  for  each  point  of  a  field  from  the  given  space  distribu¬ 
tion  and  magnitudes  of  the  electric  charges  that  are  the  source 
of  the  field. 


IS. 2  SUPERPOSITION  OP  ELECTRIC  FIELDS 


227 


15.2.2  When  the  field  is  set  up  by  a  system  of  fixed  charges 
'/I.  <?2>  •  •  ■>  <7ni  the  resultant  force  F  exerted  on  a  test  charge 

</0  (15.1.3)  at  any  point  of  the  field  being  considered  is  equal  to 
the  vector  sum  of  the  forces  F;  exerted  on  charge  q0  by  the 
fields  set  up  by  each  of  the  charges 

i=n 

F=  V  Fi. 
i=l 


According  to  Sect.  15.1.3,  F  =  g0E  and  Fi  =  q0 E;,  where  E 
is  the  strength  of  the  resultant  field,  and  Ef  is  the  strength  of 
the  field  set  up  by  charge  qt.  Following  from  the  preceding  for¬ 
mulas  is  the  principle  of  independent  action  of  electric  fields,  or 
the  principle  of  superposition  of  electric  fields: 
n 

>:=  2  E,. 

i=l 

The  strength  of  an  electric  field  of  a  system  of  charges  is  equal 
to  the  vector  sum  of  the  strengths  of  the  fields  set  up  by  each 
charge  separately.  The  strength  of  the  resultant  field  is  found 
by  superposition  of  the  strengths  of  the  fields  of  the  separate 
charges. 

For  charges  distributed  continuously  in  space  (15.2.3),  the 
principle  of  superposition  of  fields  is  of  the  form 

K=  j  dE, 

where  integration  extends  over  all  the  continuously  distributed 
sources  of  the  fields,  setting  up  electric  fields  with  the  strength 
,r/E. 

Example.  A  system  of  fixed  point  charges  qlt  q2,  .  .  .,  qn 
sets  up  an  electrostatic  field  whose  strength  E  equals 


n 


i=i 


T~3~  ri  (in  SI  '1Ilits) 
8r  ri 


n 

l?=2  — ^~3~r i  (in  °8se  units) 
,  er  ri 
1  =  1 


15* 


228 


ELECTRIC  FIELD  STRENGTH  AND  DISPLACEMENT  [15 


where  r;  is  the  radius  vector  from  point  charge  qt  to  the  point 
being  investigated  in  the  field.  Any  charged  body  on  which 
the  charge  Q  is  discretely  distributed  (15.2.3)  can  be  divided  into 
extremely  small  parts,  each  having  a  point  charge.  Therefore, 
the  preceding  formula  is  of  general  significance  for  calculating 
electrostatic  fields  in  a  homogeneous,  isotropic  medium  that 
fills  the  whole  field. 

15.2.3  Electric  charges,  the  sources  of  electrostatic  fields,  can 
be  distributed  in  space  either  discretely  ( discrete  charge  distri¬ 
bution)  at  various  points  in  space,  or  continuously  (continuous 
charge  distribution) .  In  the  latter  case,  the  charges  are  distrib¬ 
uted  either  along  a  certain  line,  on  the  surface  of  some  body 
or  within  a  certain  volume.  The  concept  of  charge  density  is 
introduced  for  continuous  charge  distribution.  When  the  elec¬ 
tric  charge  is  distributed  continuously  along  a  line  we  have 
linear  charge  density  t: 


T  = 


dq 

IT' 


where  dq  is  the  charge  of  an  infinitesimal  element  of  length  dl. 
When  the  electric  charge  is  distributed  continuously  over 
a  certain  surface,  we  speak  of  the  surface  charge  density  a: 


where  dq  is  the  charge  on  an  infinitesimal  element  of  area  dS. 
When  the  electric  charge  is  distributed  continuously  throughout 
some  volume,  we  speak  of  the  volume  charge  density  p: 


where  dq  is  the  charge  in  an  infinitesimal  element  of  volume 
dV. 

Example.  The  electrostatic  field  of  an  electric  dipole.  An  elec¬ 
tric  dipole  is  a  system  of  two  charges  of  equal  magnitude,  but 
of  opposite  sign.  These  charges,  q  and  — q  (q  >  0)  are  separated 
by  the  distance  l  which  is  small  in  comparison  with  the  distance 
to  the  points  being  considered  in  the  field.  The  arm  of  the  dipole 
ts  the  vector  1,  directed  along  the  dipole  axis  from  the  negative 
ioward  the  positive  charge  and  equal  to  the  distance  between 
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them  (Fig.  15.1).  The  product  of  the  charge  q  of  the  dipole 
(q  >  0)  hy  the  arm  1  is  called  the  electric  dipole  moment  pe: 

Pe  =  Si- 

The  strength  E  of  the  field  of  an  electric  dipole  at  an  arbitrary 
point  is: 

E  =  E+  +  E_, 

where  E+  and  E_  are  the  strengths  of  the  fields  set  up  by  the 
charges  q  and  — q  (Fig.  15.1). 


Fig. 15.1 


At  point  A,  located  on  the  dipole  axis  at  the  distance  r  from 
its  middle  (Fig.  15.1)  (r  >  l),  the  strength  of  the  field  of  a 
dipole  equals: 

'  =  on  SI  units), 

e=^Et 

err3 


(in  cgse  units). 
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At  point  B ,  located  on  a  perpendicular  erected  at  the  middle 
of  the  dipole  axis  and  at  the  distance  r  from  the  middle  of  the 
axis  (r  »  l ) 

(in  si  unit9)- 

E= - I^r-  (in  cgse  units). 

Si*/**5 

At  the  arbitrary  point  0,  sufficiently  distant  from  the  dipole 


(r  >  l)  (Fig.  15.2),  the  magnitude  of  its  field  strength  is: 
E=y- - Y 3  cos2  0  +  1  (in  SI  units), 

43T8q  Eff" 

E  =  -^*s  3  cos2  0 -j- 1  (in  cgse  units). 


15.3  Electric  Displacement.  Osfrogradsky-Gauss 
Electric  Flux  Theorem 

15.3.1  The  electric  field  strength  (15.1.3)  depends  upon  the  prop¬ 
erties  of  the  medium.  In  a  homogeneous  isotropic  medium,  the 
strength  E  is  inversely  proportional  to  er  (14.2.5).  Another 
characteristic  of  an  electric  field,  along  with  its  strength  E, 
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is  the  vector  D  of  electric  displacement,  or  electric  induction 
(other  names  are  dielectric  displacement,  dielectric  flux  density, 
displacement,  electric  displacement  density  and  electric  flux 
density).  For  a  field  in  an  electrically  isotropic  medium,  the 
relation  between  D  and  E  is  of  the  form 


I)  =  e0erE  (in  SI  units), 

1)  =  erE  (in  cgse  units). 

The  general  relation  for  D,  valid  for  anisotropic  media  is  given 
in  Sect.  18.3.4. 

'Example.  For  the  field  of  point  electric  charge  q  (14.2.3) 

u=iF‘Jrr  (in  SI  units)’ 

D—  — r  (in  cgse  units). 
ra 

The  projection  of  D  on  the  direction  of  radius  vector  r,  from 
the  point  charge  to  the  given  point  in  the  field,  is 

Dr=~^r~k (in  SI  units)’ 

Dr—-^-  (in  cgse  units). 

15.3.2  An  element  <M>e  of  electric  displacement  flux  through  an 
element  of  surface  with  the  area  dS  is  the  scalar  physical  quan¬ 
tity  defined  by  the  equation 

d<I>e  =  D  dS  =  D  dS  cos  a  =  Dn  dS  =  D  dS 

where  D  is  the  displacement  vector  (15.3.1),  n  is  a  unit  vector 
normal  to  area  dS,  dS  =  dS  n  is  the  vector  of  the  element  of 
surface  dS,  Dn  =  D  cos  a  is  the  projection  of  vector  D  on  the 
direction  of  vector  n,  dS  ^  =  dS  cos  a  is  the  area  of  the  pro¬ 
jection  of  the  element  of  surface  dS  on  a  plane  perpendicular 
to  vector  D,  and  a  is  the  angle  between  vectors  D  and  n 
(Fig.  15.3) 
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When  the  electric  field  is  set  up  by  point  charge  q,  the  element 
of  displacement  flux  d<$e  through  the  element  of  surface  dS 
of  the  closed  surface  S,  within  which  the  point  charge  is  located, 

is 


where  du>  is  the  solid  angle  subtended  at  point  charge  q  by  the 
element  dS  of  surface  S  (Fig.  15.3). 


The  total  displacement  flux  <f>e  through  the  surface  S  is  the  sum¬ 
mation  or  integration  of  all  the  elements  of  flux: 

<J)e  =  j  D  dS=  ij'  DdS  cosoc=  ^  Dn  dS  = 
s  s  s  s 

where  a  is  the  angle  between  vectors  D  and  n.  Here  all  the  vec¬ 
tors  and  normals  to  the  elements  dS  of  surface  are  directed 
toward  the  same  side  of  surface  S.  In  the  case  of  a  closed  sur¬ 
face  5  (Fig.  15.3),  all  the  vectors  n  of  the  normals  must  be 
either  outward  or  inward  ones*. 


*  Only  outward  normals  are  used  in  the  following. 
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15.3.3  The  Ostrogradsky-Gauss  theorem  states  that  the  displace¬ 
ment  flux  passing  through  an  arbitrary  closed  surface  is  equal 
to  or  proportional  to  the  algebraic  sum  of  the  electric  charges 
'li,  <h,  Is,  ■  •  Qk  enclosed  within  the  surface: 

h 

i^i  D  tfS=  (^)  Dn  dS  —  ^  1i  (in  SI  units), 
s  S  i—\ 

h 

^')  DdS  =  ^)  DndS=  4ji  ^  1i  (in  cgse  units). 

K  S  i=l 

The  displacement  flux  passing  through  an  arbitrary  closed  sur¬ 
face  that  does  not  enclose  any  electric  charges  equals  zero. 
The  differential  form  of  the  Ostrogradsky-Gauss  theorem  is 
one  of  Maxwell’s  equations  for  an  electromagnetic  field  (27.4.2.) 
^ 1 5.3.4  The  Ostrogradsky-Gauss  theorem  for  the  electrostatic 
held  in  vacuum  states  that  vector  flux  of  electrostatic  field 
strength  in  vacuum  passing  through  an  arbitrary  closed  surface 
is  proportional  to  the  algebraic  sum  of  the  electric  charges 
enclosed  within  the  surface: 

h 

E  dS  =  —  2  91  [in  SI  units,  where  e0  is  the  electric 
s  0  i=i'  constant  (14.2.7)], 

h 

(|)  E  dS  =  4n  2  9f  (in  cgse  units). 

S  i=l 

The  Ostrogradsky-Gauss  theorem  for  the  field  in  a  dielectric  is 
^liscussed  in  Sect.  18.3.3. 

T 5.3.5  Together  with  the  principle  of  superposition  of  fields 
(15.2.2),  the  Ostrogradsky-Gauss  theorem  is  applied  to  calculate 
electric  field  vectors  D.  This  is  done  by  choosing  a  closed  sur¬ 
face  in  such  a  way  that  in  the  expression  for  the  displacement 
flux,  the  quantity  D  can  be  placed  outside  the  sign  of  the  sur¬ 
face  integral.  This  can  be  done  for  fields  set  up  by  the  sim¬ 
plest  symmetrically  arranged  charges  (charged  lines,  plane, 
sphere,  etc.). 
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CHAPTER  16  ELECTRIC  FIELD  POTENTIAL 

16.1  Work  Done  in  Moving  an  Electric  Charge  in  an 
Electrostatic  Field 

16.1.1  The  work  done  in  moving  an  electric  charge  q'  in  an 
electrostatic  field  of  strength  E  (15.1.3)  is  independent  of  the 
shape  of  the  path  along  which  the  charge  is  moved;  it  depends 
only  upon  the  initial  and  final  positions  of  the  charge.  In  other 
words,  electrostatic  forces,  like  gravitational  forces,  are  poten¬ 
tial  (conservative)  forces  (3.1.6).  The  work  SW  done  by  the 
force  F  =  g'E  exerted  on  charge  q'  in  moving  it  over  the  dis¬ 
tance  dl,  is  equal  to: 

6lF  =  F  dl  cos  q>  =  q'E  cos  a  dl, 

where  a  is  the  angle  between  the  directions  of  vectors  E  and 
dt,  and  <p  is  the  angle  between  the  directions  of  the  vectors  F 
and  dl.  The  work  done  in  a  finite  movement  of  charge  q’  from 
point  a  to  point  b  is  equal  to 
b  b 

W  =  q'  j  E  cosadl  —  q'  j  E  dl, 

a  a 

where  E  dl  is  the  scalar  product  of  vectors  E  and  dl. 

'16.1.2  If  the  electric  field  has  been  set  up  by  charge  q,  then 

4m!fiT+'ilcoaai=dr’ 

and  the  work  done  in  the  movement  of  charge  q'  from  point  a 
to  point  b  in  this  field  is  equal  to 

W  =  99'  ?  dr  —  gg'  t  1 _ i_\ 

4ixe0sr  J  r2  4rte0c’,yr1  r2  /  ’ 

ri 

where  rx  and  r2  are  the  distances  of  points  a  and  b  from  charge  q 
(Fig.  16.1). 

The  work  W  done  by  the  electric  forces  of  repulsion  of  like 
charges  q  and  q'  is  positive  when  the  charges  move  away  from 


I  H.l]  WORK  IN  MOVING  AN  ELECTRIC  CHARGE 


235 


each  other,  and  negative  when  they  move  toward  each  other. 
The  work  done  by  electric  forces  of  attraction  of  unlike  charges 
1/  and  q'  is  positive  when  they  move  toward  each  other  and 
negative  when  they  move  away  from  each  other. 

■16.1.3  When  an  electric  charge  q'  is  moved  in  a  field  set  up  by 
a  system  of  point  charges  qlt  q2,  .  .  .,  qn,  the  charge  q'  is  sub¬ 
ject  to  the  force 


F  =  Fx  +  F2  +  .  .  .  +  Fn 

and  the  work  W  done  by  the  resultant  force  is  equal  to  the 
algebraic  sum  of  the  work  done  by  the  component  forces: 


W  =  W  j  -|-  . . . 


2=1 


g  < n 

4j16q£[* 


n  i 


where  and  ri2  are  the  distances  of  charge  qt  to  the  initial 
and  final  positions  of  charge  q' .  Each  of  the  amounts  of  work  Wt 
and  the  total  work  W  depend 
upon  the  initial  and  final  posi¬ 
tions  of  the  charge  q' ,  but  are 
independent  of  the  shape  of 
its  path. 

16.1.4  Circulation  of  the  field 
strength  vector  along  the  closed 
path  or  loop  L  is  the  line  in¬ 
tegral 

®  E  dl  cos  a  =  ®  E  dl. 

1  l  Fig.  16.1 

This  integral  is  numerically  equal  to  the  work  done  by  elec¬ 
trostatic  forces  in  moving  a  unit  positive  charge  along  a  closed 
path.  Since  rn  =  ri2  for  a  closed  path,  we  have,  according  to 

Sect.  16.1.3,  <^>  E  dl  =  0,  i.e.  the  above-mentioned  work 
L 

equals  zero. 

A  force  field  of  strength  E  that  complies  with  such  a  condition 
is  said  to  be  a  conservative  one.  An  electrostatic  field  is  con¬ 
servative. 


R 
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The  differential  form  of  the  condition  indicating  the  conservative 
nature  of  an  electrostatic  field  is  one  of  Maxwell’s  equations 
for  an  electrostatic  field  (27.2.1). 


16.2  Potential  of  an  Electrostatic  Field 


16.2.1  The  work  8W  (16.1.1)  is  equal  to  the  decrease  dE p  in 
potential  energy  (3.3.1)  of  the  charge  q'  moving  in  an  electro¬ 
static  field: 

8W  =  —  dEv 
and 

b 

W  —  j  q’Edl  cosa=  —  AEP  =  E1V  —  E2P, 

a 


where  £’lp  and  £’2p  are  the  values  of  the  potential  energy  of 
charge  q'  at  points  a  and  b,  and  a  is  the  angle  between  the 
.directions  of  vectors  E  and  dl  (Fig.  16.1). 

16.2.2  When  point  charge  q'  is  moved  by  electrostatic  forces 
in  a  field  set  up  by  charge  q,  the  change  dEp  in  its  potential 
energy  in  an  infinitesimal  displacement  is 


d£p=  —  8W  = 


qq’dr 

4jte0err2  ' 


Upon  a  finite  displacement  of  charge  q‘  from  point  a  to  point  b 
(Fig.  16.1),  the  change  A Ep  in  the  potential  energy  of  the  charge 
is 


r? 

A£p  =  l?2p  Ejp—^  d£p=. 


<79 


4jiensrro 


4nE„er 


(in  SI 
units) 


AE-p  —  qq'  - e  )  (in  cgse  units). 


16.2.3  When  point  charge  q'  is  moved  in  a  field  set  up  by  a 
system  of  point  charges  (qlt  q2,  ■  ■  qn) ,  the  change  in  the 
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potential  energy  of  charge  <f  is 

71 


A/i'p  =  g'  V  (  7 — — - — — — —  1  (in  SI  units), 

V  4jie08rr/2  4ji808rrij  / 

i=l 

71 

A£p  =  9'  2  ("77 - 77~)  (in  cgse  units)» 

v  *-rrJ2  fcr~!j  / 


i=t 


where  ra  and  ri2  are  the  distances  between  charges  and  q' 
before  and  after  the  displacement  of  the  latter. 

16.2.4  To  find  the  absolute  value  of  the  potential  energy  of 
an  electric  charge  at  a  given  point  of  an  electrostatic  field,  it 
is  necessary  to  choose  a  reference  point  (3.3.4)  for  the  potential 
energy.  Integrating  the  equation  in  Sect.  16.2.2,  we  obtain 
for  the  general  case: 


E 


P 


W'  |  C 

4ne0err 


where  C  is  an  arbitrary  constant.  If  we  assume  that  Ep  =  0 
as  r -*■  oo,  then  C  —  0  and  the  potential  energy  of  charge  q' , 
located  in  the  field  of  charge  q  at  the  distance  r  from  this  charge, 
equals 

Afp  =  ~r^~ -  (in  SI  units), 

1  43Te0e,.r 

(in  cgse  units). 


For  like  charges  q  and  q' ,  the  potential  energy  of  their  inter¬ 
action  (repulsion)  is  positive  and  increases  (decreases)  as  the 
charges  move  closer  together  (away  from  each  other).  In  attrac- 
pion  of  unlike  charges  E p  <  0  and  increases  to  zero  as  one  of 
the  charges  approaches  infinity.  The  curves  showing  the  de- 
dendence  of  the  potential  energy  of  two  point  charges  on  the 
tistauce  between  them  are  given  in  Fig.  10.2. 
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16.2.5  The  potential  energy  Ep  of  charge  q',  located  in  a  field 
set  up  by  a  system  of  point  charges  qx,  q2,  .  .  .,  qn  is  equal 
to  the  sum  of  the  potential  energies  Eip  in  the  fields  set  up  by 
each  of  the  charges  separately: 

n  n 

SE<p=«'  2  -dbi0" SI  unl,s>' 

2=1  2=1 


where  rt  is  the  distance  between  charges  qt  ana  q' . 


16.2.6  The  potential  of  an  electrostatic  field  is  an  energy  charac¬ 
teristic  of  the  field,  it  is  the  ratio  of  the  potential  energy  A'p 
of  charge  q'  to  the  magnitude  of  the  charge 


V 


<h 

4ite0err  i 


(in  SI  units). 


This  ratio  is  independent  of  the  magnitude  of  the  charge  q' 
and  is  numerically  equal  to  the  potential  energy  of  a  unit  test 
charge  (15.1.3)  placed  at  the  point  being  considered  in  the 
field. 
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Examples: 

(1)  The  potential  of  an  electrostatic  field  set  up  by  one  point 
charge  g  in  a  homogeneous  and  isotropic  dielectric  is 


F  (in  SI  units)’ 


V  = 


g 

err 


(in  cgse  units). 


'(2)  The  potential  of  a  field  set  up  by  a  charged  conductor 
(16.4.4)  in  a  homogeneous  and  isotropic  dielectric  is 

T,  1  f  cr  dS  ..  ot  . 

F  =  - (Y)  — -  (in  SI  units), 

4jT£0er  Jr 
S 

1  C  a  dS  ..  ... 

V  =  - (V) -  (in  cgse  units), 

Er  Jr 
S 

where  a  is  the  surface  charge  density  on  the  conductor  (15.2.3) 
and  dS  is  an  element  of  surface  of  the  conductor. 

13)  The  potential  of'an  isolated  conducting  sphere  of  radius  R, 
having  the  charge  q,  is 

V  =  (in  SI  units), 

4ne0eri?  \ 

V  =  — %—  (in  cgse  units). 

16.2.7  The  work  W  done  by  electric  forces  in  moving  charge  q' 
from  point  a  to  point  b  of  an  electrostatic  field  is 

W  =  Elp  -  £2p  =  q’  (F,  -  F2)  =  q'  AF, 

where  Elp  and  E2p  are  the  potential  energies  of  charge  q'  at 
points  a  and  b,  V1  and  F2  are  the  potentials  of  the  field  at  the 
same  points,  and  AF  =  Ft  —  F2  is  the  potential  difference. 
If  point  b  is  at  infinity,  E2p  —  0  and  F2  =  0.  Then  the  work 
Woo  required  to  move  charge  q’  from  point  a  to  infinity  is 
equal  to 

Woo  =  Elp  =  q'Vv 
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Owing  to  the  arbitrary  nature  of  point  a  subindex  1  can  be 
discarded.  Then 


Hence,  the  potential  at  a  given  point  of  an  electrostatic  held 
is  numerically  equal  to  the  work  done  by  electrostatic  forces 
in  moving  a  unit  positive  charge  from  the  point  in  the  field  to 
infinity.  This  work  is  also  numerically  equal  to  that  done  by 
external  forces  (acting  against  the  forces  of  the  electrostatic 
field)  in  moving  a  unit  positive  charge  from  infinity  to  the  given 
point  in  the  field. 

•  16.2.8  In  the  study  of  electrostatic  fields,  all  the  problems  require 
a  knowledge  of  the  potential  difference  between  any  points 
in  a  field  rather  than  the  absolute  values  of  the  potentials  at 
these  points.  Therefore,  the  choice  of  a  point  of  zero  potential 
depends  only  on  its  convenience  for  solving  the  given  problem. 
It  frequently  proves  convenient  to  assume  that  the  potential 
of  the  earth  is  equal  to  zero. 


16.3  Relation  Between  the  Potential  and  Strength 
of  an  Electrostatic  Field 

16.3.1  On  the  basis  of  the  equations  in  Sects.  16.2.1  and  16. 2.6, 
an  element  of  work  bW  done  in  the  infinitesimal  displacement 
of  charge  q'  in  an  electrostatic  field  is 


SW  =  q’E  cos  a  dl  =  —dEp  =  — q’  dV, 
or 

E  cos  a  dl  =  — dV;  E  dl  =  — dF. 

But  dl  cos  a  =  dl0  where  dl0  is  an  element  of  length  of  the  line 
of  force  (15.1.6)  (Fig.  16.3).  Hence,  E  =  — dV/dl0 .  The  deriv¬ 
ative  dV/dl0  is  the  rate  of  change  of  the  potential  along  a  line 
of  force  and  is  numerically  equal  to  the  change  in  potential 
per  unit  length  of  a  line  of  force. 
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10.3.2  The  projection  Et  of  vector  E  on  the  direction  of  motion 
dl  is  equal  to:  /?;  =  E  cos  a.  Therefore 


It  is  obvious  that  Ei  <  E  and  |  dV/dl  |  <  I  dVldl0  |.  Hence, 
E i  and  dV/dl  reach  their  maximum  values  when  dl  is  tangent 
to  the  line  of  force. 

In  the  vicinity  of  a  given  point  in  an  electrostatic  field,  the 
potential  varies  most  rapidly  in  the  direction  of  a  line  of  force. 
The  sense  of  vector  E  is  to¬ 
ward  the  most  rapid  decrease 
in  potential. 

10.3.3  In  a  more  general  form 
the  relation  between  the  strength 
and  potential  of  an  electro¬ 
static  field  is 


E  =  —grad  V, 

where  grad  V  is  the  potential 
gradient  vector,  directed  toward 

the  most  rapid  increase  in  the  potential  and  numerically  equal 
to  its  rate  of  change  per  unit  length  in  this  direction.  If  the 
potential  V  is  regarded  as  a  function  of  the  three  Cartesian  coor¬ 
dinates  of  the  given  point  in  the  field,  then 


,  „  dV  .  .  dV  .  ,  dV  , 

grad7=  ar1+i7*,+irk* 


where  i,  j  and  k  are  unit  vectors  along  the  OX,  OY  and  OZ 
axes.  The  projections  of  the  electrostatic  field  strength  vector 
on  the  coordinate  axes  are  related  to  the  field  potential  by  the 
equations: 


E 


x 


dV 

dx 


Ey  = 


dV 

dy 


~ -  and  E,=  — 


dV_ 

dz 


16.3.4  When  an  electric  charge  is  moved  in  a  direction  dl,  per¬ 
pendicular  to  a  line  of  force,  i.e.  perpendicular  to  vector  E, 
Ei  =  0  and  dVidl  =  0  (16.3.1),  i.e.  V  =  const.  The  potential 
is  the  same  at  all  points  of  a  curve  orthogonal  to  a  line  of  force. 


16-0186 
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The  geometrical  locus  of  all  points  having  the  same  potential 
is  called  an  equipotential  surface.  It  follows  from  the  aforesaid 
that  equipotential  surfaces  are  everywhere  orthogonal  to  lines 
of  force. 

The  work  done  in  moving  an  electric  charge  along  one  and  the 
same  equipotential  surface  equals  zero. 

An  infinite  number  of  equipotential  surfaces  can  be  constructed 
about  any  system  of  electric  charges.  They  are  usually  construct¬ 
ed  so  that  the  potential  difference  is  the  same  between  any  two 
adjacent  equipotential  surfaces. 

16.3.5  There  are  two  methods  for  the  graphical  representation 
of  electrostatic  fields:  by  means  of  lines  of  force  and  by  means 
of  equipotential  surfaces.  If  the  arrangement  of  the  lines  of 
force  of  an  electrostatic  field  is  known,  equipotential  surfaces 
can  be  constructed.  Conversely,  knowing  the  arrangement  of 
the  equipotential  surfaces,  the  magnitude  and  direction  of  the 
field  strength  can  be  found  for  each  point  of  the  field,  i.e.  the 
lines  of  force  can  be  constructed. 


16.4  Conductors  in  an  Electrostatic  Field 

16.4.1  Solid  metal  conductors  contain  current  carriers.  These 
are  conduction  electrons  (free  electrons)  that  can  be  moved 
by  the  external  electric  field  through  the  volume  of  the  con¬ 
ductor.  Conduction  electrons  appear  when  a  conducting  metal 
goes  over  from  a  less  condensed  to  a  more  condensed  state, 
i.e.  from  the  gaseous  to  the  liquid  or  solid  state.  At  this  valence 
electrons  (39.3.9)  are  collectivized;  they  are  separated  from 
“their”  atoms  and  form  a  special  electron  gas. 

16.4.2  The  electrical  properties  of  conductors  under  electro¬ 
static  conditions  are  determined  by  the  behaviour  of  the  con¬ 
duction  electrons  in  the  external  electrostatic  field.  In  the 
absence  of  an  external  electrostatic  field,  the  electric  fields  of 
the  conduction  electrons  and  the  positive  ions  of  the  metal 
(40.1.3),  the  atomic  cores,  mutually  compensate  each  other. 
When  a  metal  conductor  is  placed  into  an  external  electro¬ 
static  field,  this  field  redistributes  the  conduction  electrons 
in  the  conductor  so  that  at  any  point  within  the  conductor  the 
conduction  electrons  and  the  positive  ions  compensate  the 
external  electrostatic  field. 


li;./,]  conductors  in  an  electrostatic  field 
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At,  any  point  inside  a  conductor  located  in  an  electrostatic 
field,  the  strength  of  the  resultant  steady-state  electric  field 
is  equal  to  zero. 

I (>.4.3  At  the  surface  of  the  conductor  the  field  strength  vec¬ 
tor  E  should  be  normal  to  the  surface.  If  this  were  not  so  the 
tangential  component  Ex  of  vector  E  would  cause  the  charges 
to  move  along  the  conductor’s  surface,  and  this  would  contra¬ 
il  ict  the  fixed  distribution  of  the  charges.  This  leads  to  the 
following  consequences: 

(a)  E  =  0  at  all  points  inside  the  conductor;  at  all  points  on 
its  surface  E  =  En  (and  ET  =  0),  where  En  is  the  normal  com¬ 
ponent  of  the  field  strength  vector; 

(1>)  the  whole  volume  of  a  conductor  in  an  electrostatic  field 
is  equipotential,  because,  for  any  point  within  the  conductor, 

,IV 

—rr  —  — £cosa  =  0  and  F  =  const; 
ill 

(<•)  the  surface  of  the  conductor  is  also  equipotential  (16.3.4), 
because  for  any  line  on  the  surface 

— f?T  =  0  and  V  =  const; 
dl 

(d)  in  a  charged  conductor  the  uncompensated  ctiarges  are  locat¬ 
ed  only  on  its  surface.  This  follows  from  the  Ostrogradsky- 
(iauss  electric  flux  theorem  (15.3.3),  according  to  which  the 
total  charge  q  inside  a  conductor  and  enclosed  within  a  certain 
volume  bounded  by  the  arbitrary  closed  surface  S  is  equal  to 

</  =  (J>e  =  |)  D  dS  cos  a  =  0, 

S 

because  D  =  0  at  all  points  on  the  surface. 

16.4.4  When  the  electrostatic  field  is  set  up  by  a  charged  con¬ 
ductor  the  displacement  and  strength  of  this  field  close  to  the 
surface  of  the  conductor  are  found  by  the  equations 

l)n~a  and  £.,= -  (in  SI  units), 

B06r 

where  n  is  the  outward  normal  to  the  conductor’s  surface,  a  is 
the  surface  charge  density  (15.2.3),  er  is  the  relative  permittiv¬ 
ity  (dielectric  constant)  of  the  medium  (14.2.4)  and  e0  is  the 
electric  constant  in  SI  units  (14.2.7). 


16* 
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CHAPTER  17  CAPACITANCE 


17.1  Capacitance  of  an  Isolated  Conductor 


17.1.1  Upon  an  increase  in  the  amount  oi  electricity  transmitted 
to  an  isolated  conductor,  the  charges  are  distributed  over  the 
conductor’s  surface  with  varying  surface  density.  The  way  in 
which  the  charges  are  distributed  depends  only  on  the  shape 
of  the  conductor,  but  not  on  the  amount  of  electricity  that  was 
previously  on  the  conductor.  Each  new  portion  oi  charges  is 
distributed  over  the  surface  of  the  conductor  similar  to  the 
preceding  portion.  It  follows  that,  at  any  point  on  the  con¬ 
ductor’s  surface,  the  surface  charge  density  o  (15.2.3)  increases 
in  proportion  to  the  charge  q  on  the  conductor:  a  =  kq,  where 
k  =  f  (x,  y,  z)  is  a  function  of  the  coordinates  of  the  point 
being  considered  on  the  surface. 

The  potential  of  the  field  set  up  by  a  charged  conductor  (16.2.6) 
is 


V  = 


1 

4ltS„8i* 


§ 


a  dS  g 
r  4ite0er 


kdS 

r 


For  points  on  the  surface  S  of  the  conductor  this  integral  de¬ 
pends  only  on  the  size  and  shape  of  the  conductor’s  surface. 
Hence  the  potential  V  of  the  conductor  is  proportional  to  its 
charge  q. 

17.1.2  The  ratio  of  the  charge  q  of  an  isolated  conductor  to  its 
potential  V  is  called  the  capacitance  of  the  given  conductor: 

C  =  or  C'  =  4ne0er  ^  j  1  (jn  gj  units). 

s 


The  capacitance  of  an  isolated  conductor  is  numerically  equal 
to  the  charge  that  changes  the  potential  of  the  conductor  by 
one  unit. 

The  capacitance  of  an  isolated  conductor  depends  upon  its 
shape  and  dimensions.  For  geometrically  similar  conductors, 
their  capacitances  are  proportional  to  their  linear  dimen¬ 
sions. 
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The  capacitance  of  an  isolated  conductor  depends  upon  the 
dielectric  properties  of  the  surrounding  medium.  The  capaci¬ 
tance  of  a  conductor  in  a  homogeneous  isotropic  medium  is 
proportional  to  the  relative  permittivity  (dielectric  constant) 
( 14.2.4)  of  the  medium. 

17.1.3  It  follows  from  the  equation  for  the  potential  of  an 
isolated  conducting  sphere  of  radius  R  (16.2.6)  that  the  capa- 
citance  of  such  a  sphere  is 

('  --  4 ne0erR  (in  SI  units), 

C  —  etR  (in  cgse  units). 


1 7.2  Mutual  Capacitance.  Capacitors 


17.2.1  When  there  are  other  conductors  close  to  conductor  A, 
I  lie  capacitance  of  the  latter  is  greater  than  for  an  identical, 
lint  isolated,  conductor.  This  is  due  to  the  fact  that  when  the 


charge  q  is  transferred  to  con¬ 
ductor  A ,  charges  are  induced 
mi  the  conductors  surrounding 
it  under  the  influence  of  the 
held  set  up  by  conductor  A. 
The  portions  of  the  surrounding 
conductors  nearest  to  the  induc¬ 
ing  charge  q  become  opposi- 
lely  charged  (Fig.  17.1).  These 
charges  weaken  the  field  set  up 
by  charge  q,  lower  the  poten¬ 
tial  of  conductor  A  and  in¬ 
crease  its  capacitance. 

17.2.2  In  a  system  of  two  con¬ 
ductors  arranged  close  to  each 
oilier,  having  charges  q  equal 
sign,  a  potential  difference  I’j 
porlional  to  q: 


Fig.  17.1 

magnitude  but  opposite  in 
V2  is  produced  that  is  pro- 


where  C  =  q/(V1  —  F2)  is  the  mutual  capacitance  of  the  two 
conductors. 
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The  mutual  capacitance  of  two  conductors  is  numerically  equal 
to  the  charge  that  it  is  necessary  to  transfer  from  one  conduc¬ 
tor  to  the  other  to  change  the  potential  difference  between  them 
by  one  unit. 

17.2.3  The  mutual  capacitance  C  of  two  conductors  depends 
upon  their  shapes,  dimensions  and  relative  positions.  Moreover, 
C  depends  upon  the  dielectric  properties  of  the  medium  surround¬ 
ing  the  conductors.  If  the  medium  is  homogeneous  and  iso¬ 
tropic,  C  is  directly  proportional  to  the  relative  permittivity 
(14.2.4)  of  the  medium. 

When  one  of  the  conductors  is  moved  away  to  infinity,  the 
potential  difference  V1  —  V2  between  them  increases  and  their 
mutual  capacitance  is  reduced,  tending  to  the  capacitance  of 
the  remaining  isolated  conductor. 

17.2.4  A  capacitor  is  a  system  of  two  charged  conductors,  in¬ 
sulated  from  each  other  and  having  charges  of  equal  magnitude 
and  opposite  sign,  provided  that  their  shapes  and  relative 
position  are  such  that  the  conductors  set  up  a  concentrated 
electrostatic  field  in  a  limited  region  of  space.  In  the  case  of 
parallel-plate  capacitors,  the  conductors  are  called  plates.  The 
capacitance  of  a  capacitor  is  the  mutual  capacitance  of  its  plates 
(or,  in  the  general  case,  conductors). 

17.2.5  The  capacitance  of  a  parallel-plate  capacitor,  consisting 
of  two  metal  plates,  is 


eoSj-i? 

~~ J~ 


(in  SI  units), 


_  etS 
C  ~~4nd 


(in 


cgse  units), 


where  er  is  the  relative  permittivity  of  the  medium  filling  the 
space  between  the  plates,  S  is  the  area  of  each  plate,  and  d 
is  the  distance  between  them.  For  a  multiple-plate  capacitor 
consisting  of  n  plates,  the  formula  for  the  capacitance  contains 
S(n  —  1)  instead  of  S.  The  capacitance  formula  is  valid  only 
for  small  d  values,  when  the  lack  of  uniformity  of  the  electro¬ 
static  field  at  the  edges  of  the  capacitor  plates  can  be  ignored. 
17.2.6  A  spherical  capacitor  consists  of  two  concentric  hollow 
metal  spheres  A  and  if,  of  radii  iq  and  r2  (Fig.  17.2).  The  spheres 
are  charged  on  their  surfaces  and  sot  up  fields  only  outside 
themselves  (15.1.4).  Hence,  in  the  space  between  the  spheres 
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I  lie  field  is  set  up  only  by  the  charge  of  sphere  A.  Outside  the 
capacitor,  the  fields  of  the  oppositely  charged  spheres  A  and  B 
mutually  eliminate  each  other. 

The  capacitance  of  a  spherical  capacitor  is  calculated  by  the 
li  irmula 


4ite0errir, 

r-i  —  H 


(in  SI  units), 


er  r,r»  ... 

r  = - L-£-  (in  cgse  units). 

fa  —  *1 


As  r2  ->■  oo  and  l/r2  0,  the  inner  sphere  becomes  an  isolated 
line  and  C  =  4jis08I.r1  (cf.  17.1.3).  For  any  finite  values  of  r, 
a  ud  r2 

('  =  4it80Err, - — —  >  4jte0err,, 

r2~rl 

i.e.  the  capacitance  of  a  spherical  capacitor  is  greater  than  that 
of  an  isolated  hollow  sphere. 


Fig.  17.3 


17.2.7  A  cylindrical  capacitor  consists  of  two  hollow  coaxial 
metal  cylinders  of  height  h  and  radii  r,  and  ra  (Fig.  17.3).  The 
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formula  for  the  capacitance  of  a  cylindrical  capacitor  (or  of  a 
coaxial  cable)  is  of  the  form 

C_  2ne0_erfe_  gj  units), 

In  — 

'’l 

C  =  - — —  (in  cgse  units). 

2  In  — 

ri 

17.2.8  All  types  of  capacitors  are  characterized  by  their  break¬ 
down  voltage,  which  is  a  potential  difference  across  the  plates 
at  which  an  electric  discharge  (22.5.1)  passes  through  the  layer 
of  dielectric  in  the  capacitor.  The  breakdown  voltage  depends 
upon  the  properties  of  the  dielectric,  its  thickness  and  the 
shape  of  the  plates. 

17.2.9  To  obtain  a  high  capacitance  capacitors  can  be  connected 
in  parallel.  This  is  done  by  connecting  the  plates  of  like  charge 
together.  Here  the  total  capacitance  C  is 

Cii 

j=l 

where  Ct  is  the  capacitance  of  the  ith  capacitor. 

17.2.10  When  capacitors  are  connected  in  series  their  oppositely 
charged  plates  are  connected  together.  The  quantities  added 
together  in  this  case  are  reciprocals  of  the  capacitance  Cj  of 
each  capacitor: 


i=l 


The  resultant  capacitance  C  is  always  less  than  the  minimum 
capacitance  of  any  capacitor  in  the  bank.  Upon  a  connection 
in  series,  the  possibility  of  a  breakdown  (17.2.8)  of  the  capac¬ 
itors  is  reduced  because  the  potential  difference  across  each 
capacitor  is  only  a  part  of  the  total  potential  difference  across 
the  whole  bank  of  capacitors. 
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CHAPTER  18  DIELECTRICS 

IN  AN  ELECTRIC  FIELD 

18.1  Dipole  Moments  of  Molecules  of  a  Dielectric 

18.1.1  Dielectrics  are  substances  that  do  not  conduct  electric 
current.  At  not  especially  high  temperatures  and  under  con¬ 
ditions  when  dielectrics  are  not  subject  to  very  strong  electric 
iiolds,  these  substances,  in  contrast  to  conductors,  have  no 
carriers  of  electric  current. 

18.1.2  The  molecules  of  a  dielectric  are  electrically  neutral 
and  contain  an  equal  number  of  positive  and  negative  charges. 
Nevertheless,  the  molecules  do  have  electrical  properties.  As 
a  first  approximation,  a  molecule  of  a  dielectric  can  be  regarded 
as  a  dipole  having  the  electric  dipole  moment  pe  =  q\  (15.2.3), 
where  q  is  the  magnitude  of  the  total  positive  (and  also  total 
negative)  charges,  located,  respectively,  at  the  centres  of  gravity 
of  the  charges,  l  is  the  distance  between  the  centres  of  gravity 
of  the  positive  and  negative  charges.  As  any  dipole,  a  molecule 
of  matter  sets  up  an  electric  held  in  the  surrounding  space 
(15.2.3). 

18.1.3  A  dielectric  is  said  to  be  nonpolar  (a  nonpolar  dielectric) 

if  the  electrons  of  the  atoms  in  its  molecules  are  arranged  sym¬ 
metrically  with  respect  to  their  nuclei  (H2,  02,  CC14,  etc.). 
In  such  molecules  the  centres  of  gravity  of  the  positive  and 
negative  charges  coincide  when  there  is  no  external  electric  held 
\l  =  0  (18.1.2)]  and  the  dipole  moment  pc  of  the  molecule  is 
equal  to  zero.  When  a  nonpolar  dielectric  is  placed  into  an 
external  electric  held,  the  electron  shells  (39.3.6)  in  the  atoms 
(or  molecules)  are  deformed  and  the  centres  of  gravity  of  the 
positive  and  negative  charges  are  displaced  with  respect  to 
each  other  (1  0).  At  this  an  induced  electric  dipole  moment 

appears  in  atoms  (or  molecules)  of  the  dielectric.  This  moment 
is  proportional  to  the  strength  E  of  the  external  electric  hold: 

pe  =  s0aE  (in  SI  units), 
pe  =  aE  (in  cgse  units), 

where  a  is  the  polarizability  of  the  molecule  (or  atom)  and  e„ 
is  the  electric  constant  in  SI  units  (14.2.7). 
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The  polarizability  of  a  molecule  depends  only  on  its  volume. 
It  is  important  that  a  does  not  depend  upon  temperature.  The 
thermal  motion  of  the  molecules  in  nonpolar  dielectrics  has  no 
effect  on  the  induction  of  dipole  moments  in  them.  Molecules 
with  such  dipole  moments  are  like  quasi-elastic  ( induced )  di¬ 
poles. 

18.1.4  A  polar  dielectric  is  one  in  which  the  molecules  (or  atoms) 
have  electrons  arranged  asymmetrically  with  respect  to  their 
nuclei  (H20,  HC1,  NH;|,  CH3Cl,  etc.).  In  such  molecules  the 
centres  of  gravity  of  the  positive  and  negative  charges  do  not 
coincide,  but  are  at  a  practically  constant  distance  l  from  each 
other.  With  respect  to  electrical  properties,  molecules  of  polar 
dielectrics  are  like  rigid  dipoles  that  have  a  permanent  dipole 
moment  p8  =  const. 

18.1.5  A  'permanent  dipole  placed  in  a  homogeneous  external 
electrostatic  field  is  subject  to  the  action  of  a  force  couple  (4.1.6) 

with  the  moment  of  force 
(torque)  M  equal  to 

M  =  [peE]. 

The  moment  M  of  the  force  cou¬ 
ple  is  perpendicular  to  the  plane 
passing  through  vectors  pe  and 
E.  From  the  tip  of  vector  M 
rotation  through  the  smaller 
angle  from  vectors  pe  to  E  is 
seen  to  be  counterclockwise.  In 
Fig.  18.1  the  moment  of  force  M 
is  directed  normal  to  the  diagram  into  the  page  and  tends  to 
turn  the  dipole  so  that  pe  and  E  are  parallel  to  each  other.  In 
real  molecules  of  polar  dielectrics,  the  molecules  are  deformed 
in  addition  to  the  turning  of  the  dipole  axes  into  alignment 
with  the  field.  This  leads  to  the  induction  of  a  certain  dipole 
moment  (18.1.3). 

18.1.6  When  a  permanent  dipole  is  placed  into  a  nonhomo- 
geneous  electric  field,  in  which  the  field  strength  E  varies  along 
the  length  l  of  the  dipole,  then,  in  addition  to  the  torque  M 
(18.1.5),  the  dipole  is  subject  to  force  F,  which  is  equal  to 
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<>r,  in  the  more  general  form: 

.,  9E  ,  0E  ,  dE 
l'-pex-d7+Pey~di+Pe*-dT' 

where  pex,  pey  and  pez  are  the  projections  of  vector  pe  on  the 
axes  of  a  Cartesian  coordinate  system.  In  particular,  if  vector  E 
is  directed  along  the  O  V  axis  and  depends  only  on  the  coor¬ 
dinate  x  [Ex  =  E  (x),  E y  =  E z  =  0  and  E  =  Ex i],  then 


By  the  action  of  force  F  a  free  permanent  dipole  tends  to  move 
to  a  region  of  highest  field  strength. 

18.1.7  The  potential  energy  Ep  of  a  permanent  dipole  in  an 
external  electric  field  is 

E p  =  — peE  =  — ppE  cos  0, 

where  pe  is  the  electric  dipole  moment,  E  is  the  field  strength 
at  the  point  where  the  dipole  is  located  and  G  is  the  angle  be¬ 
tween  the  dipole  axis  and  vector  E.  The  minus  sign  indicates 
that  the  stable  portion  of  the  dipole  corresponds  to  the  angle 
0  =  0,  at  which  the  potential  energy  of  the  dipole  has  its  mini¬ 
mum  value  and  vector  pe  is  directed  along  vector  E. 


18.2  Polarization  ot  Dielectrics 

18.2.1  When  a  polar  dielectric  (17.1.4)  is  not  in  an  external 
electric  field,  the  vectors  of  its  dipole  moments  are  chaotically 
oriented  as  a  result  of  the  random  thermal  motion  of  the  mole¬ 
cules.  Consequently,  the  sum  of  the  dipole  moments  of  all  the 
molecules  is  equal  to  zero  in  any  physically  infinitesimal 
volume  AF*. 

No  dipole  moments  of  the  molecules  whatsoever  can  be  induced 
(18.1.3)  in  a  nonpolar  dielectric  that  is  not  in  an  external 
electric  field. 


*  Hero  AF>  v0,  whore  e0  is  the  volume  of  a  single  molecule, 
and  an  immense  number  of  molecules  are  contained  in  vol¬ 
ume  AF. 
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18.2.2  When  a  dielectric  is  placed  in  an  external  electric  field, 
it  is  polarized.  This  means  that  in  any  element  of  volume  AT, 
the  total  dipole  moment  of  its  molecules  is  no  longer  equal  to 
zero.  A  dielectric  in  such  a  state  is  said  to  be  polarized  (polar¬ 
ized  dielectric).  In  accordance  with  the  structure  of  the  mole¬ 
cules  (or  atoms),  distinction  is  made  between  the  following 
three  types  of  polarization: 

(a)  Orientation  polarization  of  polar  dielectrics  (18.1.4).  The 
external  electric  field  tends  to  align  the  dipole  moments  of  the 
permanent  dipoles  along  the  direction  of  the  field  (18.1.5). 
This  is  hindered  by  the  random  thermal  motion  of  the  molecules 
which  tends  to  disarrange  the  dipoles  arbitrarily.  The  combined 
action  of  the  field  and  thermal  motion  results  in  preferred  orien¬ 
tation  of  the  electric  dipole  moments  along  the  field.  This 
polarization  increases  with  the  strength  of  the  electric  field 
and  with  a  drop  in  temperature. 

(b)  Electronic  polarization  in  nonpolar  dielectrics  (18.1.3)  In 
the  molecules  of  dielectrics  of  this  type  dipole  moments  directed 
along  the  field  are  induced  (18.1.3)  by  the  action  of  the  external 
electric  field.  The  thermal  motion  of  the  molecules  has  no 
effect  on  electronic  polarization.  In  gaseous  and  liquid  dielec¬ 
trics,  electronic  polarization  takes  place  practically  simulta¬ 
neously  with  orientational  polarization. 

(c)  Ionic  polarization  in  solid  dielectrics  having  ionic  crystal 
lattices  (40.1.3),  such  as  NaCl  and  CsCl.  The  external  electric 
field  causes  displacement  of  all  the  positive  ions  in  the  direction 
of  the  field  strength  E  and  all  the  negative  ions  in  the  opposite 
direction. 

18.2.3  A  quantitative  measure  of  the  polarization  of  a  dielec¬ 
tric  is  the  polarization  vector  Pc.  The  polarization  vector  (or 
simply  the  polarization)  is  the  ratio  of  the  electric  dipole  moment 
of  a  small  volume  AF  of  the  dielectric  to  this  volume: 

n 

Pe==  af  Pci’ 

i=l 

where  pe,-  is  the  electric  dipole  moment  of  the  ith  molecule, 
and  n  is  the  total  number  of  molecules  in  the  volume  AF.  This 
volume  should  be  so  small  that  the  electric  field  (15.1.2)  within 
it  can  be  assumed  uniform.  At  the  same  time,  the  number  n 
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of  molecules  in  it  should  be  sufficiently  large  to  allow  statistical 
methods  of  investigation  (8.2.2)  to  be  applied. 

18.2.4  For  a  homogeneous  nonpolar  dielectric  (18.1.3)  jn  a  uni¬ 
form  electric  field 

Ee  =  noPe> 

where  n0  is  the  number  of  molecules  in  unit  volume,  and  pe 
is  the  dipole  moment  of  a  single  molecule.  Making  use  of  the 
formula  for  pe  (18.1.3)  we  obtain 

Pe  =  n0e0aE  =  e0xeE  (in  SI  units), 

Pe  =  n0aE  =  XeE  (in  cgse  units), 

where  xe  =  noa  *s  the  dielectric  (or  electric)  susceptibility  of  the 
substance. 

18.2.5  For  a  homogeneous  polar  dielectric  (18.1.4)  in  a  uniform 
electric  field,  Pe  =  n0  <pe),  where  (pe)  is  the  average  value 
of  the  component  along  the  field  of  the  permanent  dipole 
moment  of  the  molecule.  When  a  polar  dielectric  is  in  a  weak 
external  electric  fjeld,  the  dielectric  susceptibility  is  calculated 
by  the  Debye-Langevin  formula: 

noPl 

(;n  SI  units)i 
n*pi  /•  •*  \ 

Ze==lfcF  (,n  cgse  unlts)- 

Here  k  is  Boltzmann’s  constant  (8.4.5),  and  T  is  the  absolute 
temperature.  The  remaining  notation  is  given  in  Sect.  18.1.3. 
Shown  in  Fig.  18.2  is  the  dependence  xe  =  Xe  (l/y)  f°r  non¬ 
polar  (a)  and  for  polar  (6)  molecules.  The  straight  line  (6)  does 
not  pass  through  the  origin  of  coordinates  because  orientational 
and  electronic  polarization  (18.2.2b)  usually  occur  in  polar 
molecules  and  the  dielectric  susceptibility  consists  of  two  parts: 
Xe  =  Xe  +  Xe’  where  Xe  and  Xe  are  expressed  by  the  equations 
in  Sects.  18.2.4  and  18.2.5. 

18.2.6  If  a  dielectric  is  in  a  uniform  electric  field,  the  electrical 
neutrality  of  any  volume  AV  of  the  dielectric,  containing  a  suf¬ 
ficiently  large  number  of  molecules,  is  provided  for  by  the  mutual 
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compensation  of  the  dipole  charges  of  opposite  sign  alongside 
one  another  (Fig.  18.3).  In  the  thin  layers  at  surfaces  St  and  6'2 
of  the  dielectric,  bounding  its  volume,  surface-bound  polarization 
charges  appear  as  a  result  of  the  polarization  of  the  dielectric. 
At  the  surface  where  the  lines  of  force  (15.1.6)  of  the  external 
electric  field  enter  the  dielectric,  negative  charges  of  the  ends 


Fig.  18.2  Fig.  18.3 


of  the  dipole  molecules  appear.  Positive  charges  appear  at  the 
opposite  surface.  The  surface  density  (15.2.3)  op  of  the  polar¬ 
ization  charges  is  calculated  by  the  formula 

where  Pen  is  the  projection  of  vector  Pe  on  the  outward  normal 
to  the  surface  of  the  dielectric. 

In  a  nonuniform  electric  field,  the  polarization  of  a  dielectric 
is  also  nonuniform  and  the  polarization  vector  Pe  depends  upon 
the  coordinates.  Hence,  in  addition  to  surface-bound  charges, 
space  polarization  charges  also  appear.  These  are  distributed 
with  the  volume  charge  density  (15.2.3)  pp: 

Pp  =  —div  Pe, 


where  div  Pe  - -  -^e-~  +  -f-  d^ez  is  the  divergence  of 

dx  dy  '  dz  a 

the  polarization  vector. 
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18.3  Relation  Between  Displacement,  Field  Strength 
and  Polarization  Vectors 


18.3.1  Distinction  is  made  in  substances  between  two  kinds  of 
charges:  free  and  bound. 

Bound,  charges  are  those  included  in  the  composition  of  atoms 
and  molecules,  as  well  as  the  charges  of  ions  in  crystalline  solids 
with  an  ionic  lattice  (40.1.3).  Charges  not  bound  to  the  afore¬ 
mentioned  particles  of  matter  are  said  to  be  free.  Free  charges 
include:  (a)  current-conducting  charges  in  conducting  media 
[conduction  electrons  in  metals  and  semiconductors  (16.4.1), 
holes  in  semiconductors  (41.10.3),  ions  in  electrolytes  and 
gases], 

(b)  surplus  charges  that  are  transmitted  to  a  body  by  various 
means  and  violate  its  electrical  neutrality.  An  example  are 
charges  imparted  from  outside  onto  the  surface  of  a  dielectric. 

18.3.2  Generally,  an  electric  field  is  set  up  in  a  dielectric  by 
both  free  and  bound  charges.  The  field  strength  vector  E  charac¬ 
terizes  the  resultant  field  in  the  dielectric,  set  up  by  the  two 
kinds  of  charges,  and  depends  upon  its  electrical  properties: 
the  relative  permittivity  (dielectric  constant)  er  (14.2.4).  But 
the  primary  source*  of  an  electric  field  in  a  dielectric  is  the  free 
charges.  As  a  matter  of  fact,  a  field  of  bound  charges  is  set  up 
in  a  dielectric  as  a  result  of  its  polarization  when  it  is  placed 
in  an  external  electric  field  set  up  by  free  electric  charges. 

18.3.3  The  information  contained  in  Sects.  18.3.1  and  18.3.2 
call  for  a  certain  revision  of  the  Ostrogradsky-Gauss  theorem 

h 

(15.3.3).  The  algebraic  sum  of  the  charges  V,  qL  on  the  right- 

i=l 

hand  side  of  the  theorem  is  to  be  regarded  as  the  algebraic 
sum  of  the  free  electric  charges  enclosed  by  surface  S,  i.e. 

h 

y  qt  =  gfree,  and  the  theorem  is  then  of  the  form 

i=l 


$ 


D  dS  = 


§ 


DndS  =  q{ ree  (in  SI  units), 


D{\  dS  • —  4jig  j-  rf.e 


(in  cgse  units). 
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In  this  form  tho  Ostrogradsky-Gauss  theorem  is  valid  for  an 
electric  field  either  in  homogeneous  and  isotropic  or  in  nonho- 
mogeneous  and  anisotropic  media  (cl'.  15.3.3). 

18.3.4  For  an  electric  field  in  vacuum 

D  —  e0E  (in  SI  units), 

D  =  E  (in  cgse  units). 

The  flux  of  the  field  strength  vector  E  through  an  arbitrary  closed 
surface  S  in  vacuum  is 


e0En  dS  =  4(ree  (in  SI  units), 


<^>  En  dS  =  inqtree  (in  cgse  units). 

s 

In  accordance  with  Sect.  18.3.2,  the  flux  of  vector  E  for  a  field' 
in  a  substance  is 

B0EndS  =  q tree +  qbnli  (in  SI  units), 


EndS  —  An  (<?f ree  +  0bnd)  (in  cgse  units). 


The  sum  of  bound  charges  gbnd  within  closed  surface  S  is  cal¬ 
culated  by  the  equation 

9bnd  =  ^  Pen  , 

S 

where  Pen  is  the  projection  of  the  polarization  vector  on  the 
outward  normal  to  surface  dS.  Hence 

(e0Pn+  Pen)  dS  =  qlree  (in  SI  units), 
s 

<S  ( En -f- 4nPen)  d5  =  4it', free  (in  cgse  units). 

*) 

s 


IK.', |  ferroelectric  materials 
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A  correlation  with  the  general  formulation  of  the  Ostrogradsky- 
( muss  theorem  (18.3.3)  enables  a  relation  to  be  established  be¬ 
tween  vectors  D,  E  and  Pe: 

e0£n  +  Pen  (in  SI  units), 

/>„  En  +  4n Pen  (in  cgse  units), 

or,  by  virtue  of  the  arbitrary  nature  of  outward  normal  n, 

I)  e0E  +  Pe  (in  SI  units), 

1)  E  +  4nPe  (in  cgse  units). 

Tliesc  formulas  are  generalizations  of  the  formulas  in  Sect.  15.3.1. 
Making  use  of  the  results  obtained  in  Sect.  18.2.3,  we  have 
for  an  isotropic  homogeneous  medium 

I)  s0E  +  e0xeE  =  e0  (1  +  Xe)  E  (in  SI  units), 

I)  E  +  4nxeE  =  (1  +  4jixe)  E  (in  cgse  units), 
i  lonsequently 

I)  -  e0erE,  where  er  =  1  +  Xe  (in  SI  units), 

I)  --  ErE,  where  8r.=  1  +  4nxe  (in  cgse  units). 

I  ho  quantity  er  is  the  relative  permittivity  (dielectric  con¬ 
stant)  (cf.  14.2.4). 

For  vacuum  er  =  1  and  Xe  =  0. 

The  concept  of  ’Er  introduced  in  Sect.  14.2.4  has  a  meaning 
only  for  isotropic  homogeneous  media. 


18.4  Ferroelectric  Materials 

18.4.1  Ferroelectrics  belong  to  a  group  of  crystalline  dielectrics. 
The  first  substance  of  this  type  to  bo  investigated  was  Seignette’s 
salt  (potassium  sodium  tartrate)  NaKC4H4O0 -4H20  and,  con¬ 
sequently,  they  are  sometimes  called  seignette-electrics.  An¬ 
other  example  of  a  ferroelectric  is  barium  titanate  BaTi03. 
Typical  of  ferroelectrics  is  a  drastic  increase  in  the  relative 
permittivity  (dielectric  constant)  (14.2.4)  in  a  definite  tem¬ 
perature  range  (Fig.  18.4). 

18.4.2  The  relative  permittivity  8r  and  the  dielectric  suscepti¬ 
bility  Xe  (18.2.4)  of  ferroelectrics  arc  functions  of  the  field  strength 
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E  in  the  substance  (Fig.  18.5).  As  a  result,  no  linear  relation 
between  vectors  Pe  and  E  is  observed  in  these  substances.  The 
dependence  of  the  electric  displacement  I)  on  the  field  strength 


Fig.  18.4  Fig.  18.5 


is  of  a  complex  nature  and  a  linear  relation  between  D  and  E 
exists  only  at  very  high  E  values  (Fig.  18.6). 

18.4.3  A  monocrystal  of  a  ferroelectric  is  divided  into  spontane¬ 
ously  polarized  regions,  called  domains  (cf.  26.5.4).  This  spon¬ 
taneous  polarization  of  the  domains  is  a  result  of  the  orientation 
of  the  dipole  moments  of  the  molecules  in  the  domain  in  a  deft- 
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mil1  direction.  In  the  absence  of  an  external  electric  field,  the 
polarization  vectors  of  the  various  domains  are  directed  at  ran¬ 
dom  and  lor  a  large  monocrystal  or  a  polycrystal  the  total 
polarization  is  equal,  on  an  average,  to  zero.  When  a  ferro- 
ideclric  is  placed  in  an  electric  field,  the  electric  moments  of 
i  lie  domains  are  reoriented  and  the  crystal  acquires  a  total 
polarization  not  equal  to  zero. 

IH.4.4  Domains  are  formed  in  ferroelectrics  within  a  definite 
temperature  range,  between  the  upper  and  lower  Curie  points 
I)!1."  and  0>.ow(ci.  26.5.2).  For  Seignette’s  salt  (also  called  Rochelle 
salt)  (18.4.1),  0gp  =  298  K  and  0j,ow  =  258  K.  At  T  >  0gp 
the  forces  of  interaction  between  the  dipoles  cannot  resist  the 
ihermal  motion,  the  spontaneous  polarization  of  the  domains 
is  violated  and  the  ferroelectric  is  converted  into  an  ordinary 
polar  dielectric.  The  drastic  increase  in  heat  capacity  of  the 
substance  indicates  that  a  second-order  phase  transition  (12.4.2) 
occurs  at  the  Curie  point.  A  disordered  phase  exists  above  the 
i  iuric  point  0pp  and  the  dielectric  is  not  polarized  in  the  absence 
of  an  external  field.  At  T  <  0“p  there  is  an  ordered  phase  indi¬ 
cated  by  the  spontaneous  po¬ 
larization  in  the  domains*. 

18.4.5  The  phenomenon  of  di¬ 
electric ,  or  ferromagnetic ,  hystere¬ 
sis  (lag)  is  observed  in  ferro¬ 
electrics.  As  is  evident  from 
Fig.  18.7,  with  an  increase  in  the 
strength  of  the  external  field, 
the  magnitude  of  vector  Pe 
increases  and  reaches  satura¬ 
tion  at  point  a.  Then,  as  E  is 
reduced  to  zero,  a  ferroelectric 
has  remanent  polarization,  char¬ 
acterized  by  the  value  of  the  polarization  vector  Pe0. 
Polarization  completely  disappears  only  after  applying  a  depo¬ 
larization  electric  field  of  the  strength  — Ec  in  the  opposite 

*  A  discussion  of  the  phenomena  that  occur  in  ferroelectrics  at 
V  =  0'gW  and  load  to  the  destruction  of  the  domains  is  beyond 
the  scope  of  the  present  handbook. 
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direction.  This  is  called  the  coercive  force.  Periodic  reversal  of 
polarization  of  a  ferromagnetic  material  is  accompanied  by 
the  consumption  of  electric  energy,  spent  in  heating  the  sub¬ 
stance.  The  area  of  the  hysteresis  loop  is  proportional  to  the 
electric  energy  that  is  converted  into  internal  energy  per  unit 
volume  of  the  ferroelectric  per  cycle. 


CHAPTER  19  ENERGY  OF  AN  ELECTRIC  FIELD 


19.1  Energy  of  a  Charged  Conductor  and  an  Electric 
Field  * 


19.1.1  The  charging  of  an  electric  conductor  requires  the  per¬ 
formance  of  work  that  must  be  done  in  overcoming  the  Coulomb 
forces  of  repulsion  between  like  charges.  This  work  increases 
the  electrical  energy  of  a  charged  conductor  in  the  same  sense 
as  the  potential  energy  in  mechanics  (3.3.1).  The  work  6  IF  done 
in  bringing  a  charge  dq  from  infinity  to  the  conductor  is 

bW  =  V  dq  =  CV  dV, 

where  C  and  V  are  the  capacitance  and  potential  of  the  conduc¬ 
tor. 

The  work  required  to  charge  a  conductor  from  zero  potential 
to  the  potential  V  is 

V 

f  CV 2 

W=  \  CV  dV  =  . 

0 

Correspondingly,  the  energy  of  a  charged  isolated  conductor 
(intrinsic  energy  of  a  charged  conductor)  is 


Ee 


CF2 

2 


i!_ 
2  C 


2 


Vq. 


*  It  is  assumed  in  this  chapter  that  the  electric  charges  and 
charged  bodies  are  in  an  isotropic,  homogeneous,  non  ferromagne¬ 
tic  (18.4.1)  medium. 
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The  energy  of  a  charged  capacitor  is 


He 


C  (AF)2 
2 


where  C  and  q  are  the  capacitance  and  charge  o!  the  capacitor, 
and  AF  is  the  potential  difference  between  oppositely  charged 
plates  of  the  capacitor. 

19.1.2  The  energy  of  a  system  of  charges,  distributed  continu¬ 
ously  throughout  the  volume  of  a  dielectric  or  vacuum,  and 
along  the  surfaces  of  charged  conductors  and  an  electrified  dielec¬ 
tric,  is 

e=t  J  VodS+ 4r  j  ypdv 

®char  ”char 

where  a  and  p  are  the  surface  and  volume  density  of  free  charges, 
V  is  the  potential  of  the  resultant  field  of  all  the  surface  and 
volume  charges  at  the  points  of  the  elements  dS  or  dv  of  the 
charged  surface  or  volume  (to  avoid  confusion,  volume  is  denot¬ 
ed  here  by  the  lower-case  v).  Integration  is  carried  out  over  all 
charged  surfaces  SCha,.  and  throughout  all  charged  volumes 
f’„ilar.  The  effect  of  the  dielectric  is  that  with  a  constant  dis¬ 
tribution  of  the  free  charges  the  values  of  V  differ  for  the  differ¬ 
ent  dielectrics.  Thus,  in  a  homogeneous  isotropic  dielectric 
that  fills  the  whole  field,  F  is  er-fold  less  than  in  vacuum. 

19.1.3  The  intrinsic  energy  of  a  charged  conductor  is,  at  the 
same  time,  the  energy  of  its  electrostatic  field.  Hence,  for  the 
uniform  electrostatic  field  (15.1.3)  of  a  parallel-plate  capacitor 
(17.2.5)  we  have 

Ee  ^  y  (AF)2  C  =  — Sd  =  v  (in  SI  units), 

Ee  - — ^ — v  (in  cgse  units), 

oJT 

where  v  =  Sd  is  the  volume  of  the  electrostatic  field  between 
the  capacitor  plates.  The  energy  of  the  field  is  proportional 
to  its  volume,  and  the  energy  per  unit  volume  of  the  electro^ 
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static  field,  called  the  volume  energy  density  E jj,  is  the  same 
at  all  points  of  a  uniform  field: 


E 

E 


a 

e 

d 

e 


Ee  _  £0srE* 
v  .  2 

Eq  &fE 

v  8m 


ED 


(in  SI  units), 


ED 


8m 


(in  cgse  units). 


19.1.4  In  the  case  of  nonuniform  electrostatic  fields,  set  up  by 
arbitrary  charged  bodies,  the  volume  energy  density  at  each 
point  of  an  isotropic  medium  is  expressed  by  the  formulas  in 
the  preceding  subsection.  But  when  the  medium  is  electrically 
anisotropic,  the  volume  energy  density  of  the  electric  field  is 

DE  (in  SI  units), 

DE  (in  cgse  units). 

19.1.5  The  energy  dEe  of  an  infinitesimal  volume  of  an  arbitrary 
electrostatic  field  in  an  isotropic  medium  is 

dEe  =  E*  dv^  —  ^'-  dv  (in  SI  units). 


The  total  energy  Ee  of  an  electrostatic  field  is 

*.=  J 

’’field 

where  integration  is  carried  out  throughout  the  whole  volume 
Afield 

19.1.6  The  total  energy  of  an  electrostatic  field  set  up  by  an 
arbitrary  charged  body  is  equal  to  the  intrinsic  energy  of  the 
body  (19.1.1): 


Ee 


£T2 

2 


j  ^f-dv. 

"field 
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I'li is  equation  can  be  generalized  for  the  case  of  an  electrostatic 
lit •  I ( I  set  up  by  an  arbitrary  system  of  charges.  The  total  energy 
ul  such  a  system  (19.1.2)  coincides  with  the  total  energy  of  the 
electrostatic  field  set  up  by  this  system  of  charges: 

j  »UV=\  [  FadS-fi-  [  Vp  dv. 

"I'eld  ®char  °char 


19.2  Energy  of  a  Polarized  Dielectric 

19.2.1  The  process  of  polarizing  a  homogeneous  isotropic  dielec- 
iric,  placed  in  an  external  electric  field,  is  accompanied  by 
work  done  in  deforming  the  electron  shells  in  the  atoms  and 
molecules  or  in  turning  the  dipole  axes  of  the  molecules  into 
alignment  with  the  field  strength.  Therefore,  a  polarized  dielec¬ 
tric  has  a  margin  of  electric  energy. 

19.2.2  The  volume  energy  density  of  a  polarized  dielectric  at 
a  point  with  the  field  strength  E  equals: 

/-.^(!jii,|)=  e°(Sr  — l)  E 2  (in  SI  units) 


The  volume  energy  density  of  a  field  with  the  same  strength  E 
in  vacuum  is 

a  __  cq#2 
/y<*(vac)  2 

The  volume  energy  density  of  a  field  in  a  dielectric  is  s* 
e0er/?2/ 2,  and  according  to  the  energy  conservation  law  it 
equals 

/,d  —  ;/d  //d 

'e  ^igvac)  1  ^e(diol)- 
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CHAPTER  20  DIRECT  ELECTRIC  CURRENT 

20.1  Concept  of  an  Electric  Current 

20.1.1  Electrodynamics  is  the  main  branch  oi'  the  science  of 
electricity  that  deals  with  the  phenomena  and  processes  invol¬ 
ved  in  the  motion  of  electric  charges  or  macroscopic  charged 
bodies.  The  most  vital  concept  in  electrodynamics  is  that  of 
electric  current. 

20.1.2  An  electric  current  is  any  ordered  motion  of  electric  charges. 
An  electric  current  produced  in  conductive  media  as  the 
result  of  the  ordered  motion  of  free  charges  by  the  action  of  an 
electric  field  is  called  conduction  current.  Examples  of  conduction 
currents  are  the  currents  in  metals  and  semiconductors,  asso¬ 
ciated  with  the  ordered  motion  of  “free”  electrons,  and  currents 
in  electrolytes,  which  are  the  ordered  displacement  of  ions  of 
opposite  signs. 

A  convection  current  is  the  ordered  motion  in  space  of  charged 
macroscopic  bodies.  An  example  of  such  a  current  is  that  asso¬ 
ciated  with  the  motion  of  the  earth,  having  a  surplus  negative 
charge,  along  its  orbit. 

20.1.3  Upon  ordered  motion  of  electric  charges  in  a  conductor, 
the  equilibrium  distribution  of  charges  is  violated  and  the  sur¬ 
face  of  the  conductor  is  no  longer  an  equipotential  surface 
(16.4.3).  On  the  surface  of  the  conductor  there  is  a  tangential 
component  of  the  field  strength  ( Ex  0)  and  there  should  be 
an  electric  field  inside  the  conductor  (cf.  Sect.  16.4.3).  The  elec¬ 
tric  current  continues  until  all  the  points  of  the  conductor 
become  equipotential. 

20.1.4  The  conditions  required  for  the  initiation  and  existence 
of  an  electric  current  in  a  conducting  medium  are  as  follows: 

(a)  The  availability  of  free  current  carriers— charged  particles— 
in  the  medium,  capable  of  ordered  motion  in  it.  In  metals  and 
semiconductors  suitable  particles  are  the  conduction  electrons 
(16.4.1);  in  liquid  conductors  (electrolytes)  they  are  the  positive 
and  negative  ions;  in  gases  they  are  the  oppositely  charged  ions 
and  electrons. 

(b)  The  existence  in  the  given  medium  of  an  external  electric 
field  whose  energy  is  spent  in  providing  ordered  motion  of  the 
electric  charges.  To  maintain  an  electric  current,  the  energy 
of  the  electric  field  must  be  continuously  replenished,  i.e. 
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.1  source  of  electric  energy  is  required.  This  is  a  device  that  con¬ 
verts  some  kind  of  energy  into  the  energy  of  an  electric  field. 
20.1.5  The  direction  ol  ordered  motion  of  the  positive  electric 
charges  is  conventionally  taken  as  the  direction  of  the  electric 
current.  As  a  matter  of  fact,  in  metallic  conductors  the  current 
consists  of  the  ordered  motion  of  electrons,  which  move  in 
l fie  direction  opposite  to  that  of  the  current. 


20.2  Current  and  Current  Density 

211.2.1  The  current  strength  (or  simply  the  current)  I  is  a  scalar 
quantity,  the  rate  of  flow  of  electricity.  It  is  equal  to  the  ratio 
ol'  I  lie  charge  dq,  conveyed  through  the  surface*  being  considered 
during  a  small  interval  of  time,  to  the  amount  of  this  time  dl: 


An  electric  current  is  said  to  be  direct  ( direct  electric  current) 
if  its  magnitude  and  direction  remain  constant  with  time.  For 
direct  current 


where  q  is  the  plectric  charge  conveyed  through  the  surface* 
being  considered  during  the  finite  time  interval  from  0  to  t. 
211.2.2  If  the  current  is  a  direct  one,  the  charges  cannot  accu¬ 
mulate  or  be  depleted  at  any  part  of  the  conductor.  A  direct 
current  circuit  should  be  closed  and  comply  with  the  condition 
((s,  —  Q s  ,  where  Qs<  is  the  total  electric  charge  passing  in 
unit  time  through  surface  S1  into  the  volume  of  the  conductor 
enclosed  between  the  cross  sections  Sx  and  S.2,  and  is  the 
iolal  electric  charge  emerging  from  this  volume  in  unit  time 
llirough  surface  S2. 

20.2.3  The  current  density  determines  the  current  direction  at 
various  points  of  the  surface  being  considered  and  the  distri¬ 
bution  of  the  current  over  this  surface.  The  current  density 


1  In  the  case  of  conduction  current,  through  the  cross  suction 
of  the  conductor. 
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vector  j  is  opposite  in  direction  to  the  motion  of  the  electrons 
that  carry  the  current  in  metals*  and  is  numerically  equal 
to  the  ratio  of  the  current  dl  through  an  element  of  surface 
normal  to  the  motion  of  the  charged  particles  to  the  quantity 
dS' ,  which  is  the  area  of  this  element  of  surface: 

dl 

dS '  • 

A  more  general  relation  between  the  current  density  j  and  an 
element  of  current  dl  is 

dl  =  j  dS, 


where  dS  =  n  dS  is  the  vector  of  an  element  of  surface,  and 
n  is  the  unit  vector  of  the  normal  to  surface  dS  and  makes  angle 
a  with  vector  j. 

20.2.4  The  current  through  an  arbitrary  surface  S  is 


in  dS, 


where  ;n  =  j  cos  a  is  the  projection  of  vector  j  on  the  direction 
of  normal  n  (20.2.3),  and  integration  is  carried  out  over  the 
whole  area  of  surface  S.  If  the  conduction  current  is  being  cal¬ 
culated  on  the  basis  of  the  cross  section  of  the  conductor  for 
which  ;n  =  j,  then 

1=  |  j  dS. 
s 

20.2.5  The  density  of  direct  current  is  the  same  over  the  whole 
cross  section  S  of  a  homogeneous  conductor.  Here 

/  -  jS. 

In  a  direct  current  circuit,  the  current  density  of  two  cross 
sections  S t  and  S2  is  inversely  proportional  to  the  cross-sectional 
areas: 

ix 

/■2  S1 

*  In  other  conducting  media  vector  j  coincides  in  direction  with 
that-  of  the  positively  charged  current  carrier?, 
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20.3  Fundamentals  of  the  Classical  Electron  Theory 
of  Electrical  Conduction  in  Metals 

20.3.1  The  high  electrical  conduction  of  metals  is  due  to  the 
fact  that  metals  contain  an  immense  number  of  current  carriers, 
the  conduction  electrons ,  which  are  formed  of  the  valence  electrons 
(30.3.9)  of  the  atoms  of  metal.  These  electrons  do  not  belong 
to  any  definite  atom,  but  are  collective  electrons.  According  to 
the  classical  Drude-Lorentz  electron  theory,  these  electrons  are 
regarded  as  an  electron  gas  (16.4.1)  that  possesses  all  the  prop¬ 
erties  of  a  monatomic  ideal  gas  (8.4.1). 

The  number  of  conduction  electrons  in  unit  volume  of  a  mono¬ 
valent  metal  is 


where  N a  is  Avogadro’s  number  (Appendix  II),  ma  is  the 
atomic  mass  of  the  metal  and  p  is  its  density.  In  order  of  mag¬ 
nitude  n0  x  (1028  to  1029)  m-3. 

In  the  absence  of  an  electric  field  inside  the  metal  the  conduction 
electrons  move  at  random  and  collide  with  the  ions  of  the  crystal 
lattice  of  the  metal  (40.1.1).  It  is  assumed  that  the  mean  free 
path  (X)  of  the  electron  (10.5.1)  should  bo  of  the  order  of  mag¬ 
nitude  of  crystal  lattice  constants  of  the  metal,  i.e.  {X)  x 
x  10-10  m. 

The  average  kinetic  energy  of  thermal  motion  of  the  electrons 
(10.2.4)  is 


3_ 

2 


kT, 


where  m  is  the  mass  and  j;rms  is  the  root-moan-square  velocity 
of  the  electrons  (10.2.3).  At  the  temperature  T  =  273  K,  the 
velocity  u,.ms  »  103  m/s. 

The  arithmetic  mean  velocity  (u)  of  thermal  motion  of  the 
electrons  (If). 3. 6)  is  of  the  same  order  of  magnitude. 

20.3.2  Electric  current  in  a  metal  is  initiated  by  the  action  of 
the  external  electric  field  (20.1 .4),  which  leads  to  ordered  motion 
of  the  electrons.  The  current  density  j  is  equal  to  the  charge 
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of  all  the  electrons  that  pass  in  unit  time  through  unit  area  of 
the  conductor’s  cross  section, 
j  =  n0e  (v), 


where  n0  is  the  number  of  conduction  electrons  per  unit  volume, 
e  is  the  magnitude  of  the  charge  of  the  electron,  and  (v)  is  the 
average  velocity  of  ordered  motion  of  the  electrons  due  to  the 
action  of  the  external  field.  At  the  highest  current  densities 
(v)  is  1()~4  m/s  and  is  negligibly  small  compared  to  the  thermal 
velocities  of  the  electrons  (20.3.1). 

20.3.3  A  steady  electric  current  is  established  in  a  circuit  in 
the  time  t  =  Lie ,  where  L  is  the  length  of  the  circuit  and  c  is 
the  velocity  of  light  in  vacuum.  The  time  t  coincides  with  the 
time  required  to  establish  a  steady-slate  electric  field  along 
the  circuit  and  the  initiation  of  ordered  motion  of  the  electrons 
along  the  whole  circuit.  Therefore,  electric  current  is  started 
practically  simultaneously  with  the  closing  of  the  circuit. 

20.3.4  Ohm's  law  for  current  density  ( Ohm's  law  in  differential 
form)  is 


The  current  density  in  a  conductor  is  the  product  of  the  electrical 
conductivity  x  by  the  electric  field  strength  E.  The  quantity 
p  =  1/x  is  called  the  electrical  resistivity. 

According  to  the  classical  electron  theory,  electrical  conductiv¬ 
ity  and  resistivity  are  calculated  by  the  formulas 


rcpe2  W 
2m  ( u ) 


and 


2m  (u) 
(X)  ’ 


where  n0  is  the  number  of  electrons  in  unit  volume  of  the  metal, 
{X)  is  the  mean  free  path  of  the  electron  (10.5.1),  (u)  is  the 
arithmetic  mean  velocity  of  thermal  motion  of  the  electrons 
(10.3.6),  and  e  and  m  are  the  magnitude  of  the  charge  of  the 
electron  and  its  mass. 

20.3.5  By  the  action  of  the  field  the  electron  acquires  a  velocity 
equal  to  rnIax  at  the  end  of  its  mean  free  path.  Upon  collision 
with  an  ion  at  this  point,  the  electron  loses  its  velocity  and 
the  energy  of  ordered  motion  of  the  electron  is  converted  into 
internal  energy  of  the  conductor,  which  is  heated  as  the  current 
passes  through. 
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The  volume  density  E^  of  the  healing  capacity  of  a  current  is  the 
;i mount  of  energy  generated  in  unit  volume  of  a  conductor  in 
unit  time.  The  Joule-Lenz  law  for  the  volume  density  of  the 
heating  capacity  of  a  current  is 


A.’!!  —  jE  =  x£2  =  —  £2. 

The  J oule-Lenz  law  in  its  differential  form,  states:  the  volume 
density  of  the  heating  capacity  of  a  current  is  equal  to  the 
scalar  product  of  the  current  density  vector  by  the  electric  held 
strength  vector. 

The  volume  density  of  the  heating  capacity  of  a  current  is  inde¬ 
pendent  of  the  kind  of  collisions  of  the  electrons  with  the  crystal 
lattice  points  [elastic  or  inelastic  collision  (3.5.3)].  It  follows 
from  the  laws  of  conservation  of  energy  and  momentum  that  the 
energy,  AE  transmitted  to  the  ion  upon  a  collision  of  the  elec- 
I  ron  with  an  ion,  is  only  a  small  part  of  the  electron’s  energy  Ee\. 
In  an  inelastic  collision  AE/Ee]  —  ml (m  ;  M),  whereas  in 
an  elastic  collision  AA7£’ei  =  4 mM/(tn  +  A?)2,  where  m  is 
I  lie  mass  of  the  electron  and  M  is  the  mass  of  the  ion.  Practi¬ 
cally,  in  either  case,  AElEe\  x  ml M  x  10~4. 

20.3.6  The  Wiedemann-Franz  law  states  that  for  all  metals  the 
ratio  of  the  thermal  conductivity  K  (10.8.5)  to  the  electrical 
conductivity  x  is  directly  proportional  to  the  absolute  tem¬ 
perature  T : 


K_ 

x 


=  3 


<±Y 


T, 


where  k  is  Boltzmann's  constant  (8.4.5)  and  e  is  the  charge  of 
tlie  electron. 

20.3.7  The  classical  electron  theory  of  electrical  conduction 
of  metals  has  the  following  shortcomings: 

(a)  it  cannot  explain  the  experimentally  observed  linear  rela¬ 
tion,  in  a  wide  temperature  range,  between  the  resistivity  p 
and  the  absolute  temperature:  p  cc  T\ 

(b)  it  yields  an  incorrect  value  for  the  molar  heat  capacity  of 
metals.  This  should  be  equal,  according  to  this  theory,  to 
9  cal/mol-K  and  be  the  sum  of  the  heat  capacity  of  the  ionic 
crystal  lattice  (6  cal/mol-K)  and  the  heat  capacity  of  the  mon¬ 
atomic  electron  gas  (3  cal/mol-K).  It  is  known,  however,  from 
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the  experimentally  established  Dulong  and  Petit  law  (41.7.2) 
that  the  molar  heat  capacity  oi'  metals  differs  only  slightly  from 
that  of  other  solids  and  is  approximately  6  cal/mol-K.  It  is 
impossible  on  the  basis  of  classical  theory  to  explain  the  absence 
of  an  electron  component  of  heat  capacity  of  metals; 

(c)  the  experimental  value  of  the  resistivity  p  and  the  theoretical 
value  of  the  arithmetic  mean  velocity  ( u )  of  the  electrons  lead, 
in  formulas  of  Sect.  20.3.4,  to  a  value  of  the  mean  free  path 
(X)  of  the  electron  that  exceeds  the  crystal  lattice  constant  of 
the  metal  by  two  orders  of  magnitude.  This  contradicts  the 
assumptions  of  the  classical  electron  theory  of  the  electrical 
conduction  of  metals. 
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21.1  Extraneous  Forces 

21.1.1  A  metallic  conductor  has  an  electrostatic  field  that  is 
set  up  by  the  electrons  and  positive  ions  of  the  crystal  lattice. 
This  is  a  field  of  Coulomb  forces  (14.2.2).  Coulomb  interaction 
between  the  charges  in  a  metal  leads  to  such  an  equilibrium 
distribution  of  charges  that  the  electric  field  inside  the  conductor 
equals  zero  and  the  whole  conductor  is  equipotential  (16.4.3). 
Consequently,  an  electrostatic  Coulomb  field  cannot  be  the 
cause  of  the  steady-state  process  of  ordered  motion  of  electrons, 
i.e.  it  cannot  generate  and  maintain  a  direct  (constant)  current. 

21.1.2  Extraneous  jorces  are  nonelectrostatic  forces  whose 
action  exerted  on  the  conduction  electrons  in  the  conductor 
initiates  their  ordered  motion  and  maintains  a  direct  (constant) 
electric  current  in  the  circuit.  In  contrast  to  Coulomb  forces, 
extraneous  forces  do  not  join  unlike  charges;  they  lead  to  their 
separation  and  maintain  a  potential  difference  at  the  ends  of 
the  conductor. 

Extraneous  forces  are  the  cause  of  the  nonelectrostatic  electric 
field,  existing  in  the  conductor  and  providing  for  ordered  motion 
of  the  charges  from  points  with  a  higher  potential  to  points 
with  a  lower  potential.  A  steady-state  electric  field  of  extraneous 
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forces  is  set  up  by  sources  of  electrical  energy  (galvanic  cells, 
electric  generators,  etc.). 

21.1.3  In  a  closed  hydraulic  system  that  provides  for  constant 
circulation  of  a  liquid  (Fig.  21. i),  the  liquid  travels  from  point  A 
to  point  B ,  in  the  direction  opposite 

to  the  pull  of  gravity,  by  the  action  _ B 

of  certain  “extraneous  forces”  exerted 

on  the  liquid  by  pump  P.  The  pump  - - 

produces  a  constant  difference  in  .  . 

hydrostatic  pressure  between  points  -  — - 

B  and  A ,  and  the  liquid  flows  by 

gravity  from  B  to  A.  A  source  of 

electrical  energy  plays  a  similar  role  ,,  V 

in  a  direct  current  circuit.  Owing  to 

the  electric  field  set  up  by  extraneous 

forces  and  existing  inside  the  source,  ‘  ~  '  ““ 

ihe  electric  charges  move  within  the 

source  against  the  forces  of  the  elec- 

trostatic  field.  This  maintains  a  po-  — 1 

tential  difference  at  the  ends  of  the 

external  circuit  that  is  required  for  _ (/?) _ ■* _ // 

a  direct  current  to  /low.  At  the  ex- 

pense  of  the  energy  consumed  in  the  ,,\  \ 

source,  work  is  done  to  obtain  or- 

dered  motion  of  the  electric  charges. 

In  a  direct  current  generator,  for  example,  the  work  of  the 
extraneous  forces  (21.1.2)  is  done  at  the  expense  of  the 
mechanical  energy  required  to  drive  the  rotor  of  the  generator. 


B 

-J 

— - 

_ rn\ _ _ 

Fig.  21.1 


21.2  Ohm's  Law  and  the  Joule-Lenz  Law 

21.2.1  At  any  point  inside  a  portion  of  a  conductor  that  con¬ 
tains  a  source  of  electrical  energy  there  is  an  electrostatic  field 
of  Coulomb  forces  with  the  strength  Ec0U]  and  an  electric  field 
of  extraneous  forces  with  the  strength  Eextr. 

According  to  the  principle  of  superposition  of  fields  (15.2.2) 
the  strength  of  the  resultant  field  is 


E  EROni  -j-  Eextr. 
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Ohm’s  law  (or  current  density  (20.3.4), 

1 

j  —  (Ecoul  +Eextr)> 

r 

enables  the  following  relation  to  be  obtained  for  portion  1-2 
of  a  homogeneous  conductor  of  cross-sectional  area  S: 

2  2  2 

/  p-g-  —  Ecou]  dl -J-  ^  Eextr  dl, 

1  1  1 

where  I  is  the  current  in  the  conductor,  d\  is  a  vector  with  the 
magnitude  dl,  equal  to  an  element  of  conductor  length,  and  is 
directed  tangent  to  the  conductor  toward  the  current  density 
vector,  and  S  is  the  cross-sectional  area  of  the  conductor. 

2 

21.2.2  The  integral  j  Ec0U]  dl  is  numerically  equal  to  the 

l 

work  done  by  the  Coulomb  forces  in  moving  a  unit  positive 
charge  from  point  1  to  point  2.  According  to  Sect.  16.3.1 

2 

J  EC0Ulrfl  =  71-Vi, 

1 

where  V1  and  V2  are  the  potentials  at  points  1  and  2  of  the  con¬ 
ductor. 

The  electromotive  force  (emf)  1'12,  acting  over  the  portion  1-2 
of  the  circuit,  is  the  name  given  to  the  linear  integral 


2 


1 


The  electromotive  force  <f12  is  numerically  equal  to  the  work 
done  by  the  extraneous  forces  in  moving  a  unit  positive  charge 
along  the  conductor  from  point  1  to  point  2.  This  work  is  done 
at  the  expense  of  the  energy  consumed  in  the  source. 

The  voltage  (or  voltage  drop)  t/12  across  the  portion  1-2  is  the 
physical  quantity  numerically  equal  to  the  work  done  by  the 
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resultant  field  of  Coulomb  and  extraneous  forces  in  moving 
;>  unit  positive  charge  along  the  circuit  from  point  1  to  point  2. 
Thus 

2 

^  (Ecoul +  Eextr)  dl — 

1 


J  EJl, 


or 

-  o',  -  vt)  + 

The  voltage  across  the  ends  of  the  circuit  coincides  with  the 
potential  difference  only  when  no  emf  is  applied  to  this  por¬ 
tion. 

The  integral 

2 


1 

is  the  resistance  Rvi  in  the  portion  of  the  circuit  between  sec¬ 
tions  1  and  2.  For  a.homogcneous  cylindrical  conductor  (wire), 
p  =  const,  S  =  const 

rt12  -  P  ~g~  1 

whore  I12  is  the  length  of  the  conductor  between  cross  sections  1 
and  2. 

21.2.3  The  generalized  Ohm's  law  for  an  arbitrary  portion  of 
a  circuit  is 

77?i2  =  U\2  ~  (hi  —  F2)  -f-  ^i2. 

The  product  of  the  current  by  the  resistance  of  a  portion  of  the 
circuit  is  equal  to  the  potential  difference  across  this  portion 
plus  the  emf  of  all  sources  of  electrical  energy  connected  in  the 
given  portion  of  the  circuit.  In  this  form  Ohm’s  law  is  applicable 
both  for  passive  portions  of  the  circuit,  not  containing  sources 
of  electrical  energy,  as  well  as  for  active  portions  that  contain 
such  sources. 

21.2.4  The  sign  rule  for  the  emf  of  sources  of  electrical  energy 
included  in  portion  1-2  states  that  if  the  current  inside  the 
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sourco  is  from  the  cathode  to  the  anode,  i.e.  the  strength  of  the 
held  of  extraneous  forces  coincides  in  direction  with  the  current 
in  this  portion  of  the  circuit,  then,  in  calculations,  the  emf  <g12 
of  this  source  is  considered  positive.  If  the  current  inside  the 
sourco  is  from  the  anode  to  the  cathode,  the  emf  %  12  of  this 
source  is  considered  negative  (Fig.  21.2). 

21.2.5  In  a  closed  series  electric  circuit,  the  current  is  the  same 
at  all  cross  sections,  and  such  a  circuit  is  a  portion  with  coin- 


/ 

o - >■ 

I 


Z  1 


-O  Or 


1 


2 


(a)  ;  (t>) 


Fig.  21.2 


ciding  ends  (points  1  and  2  coincide).  In  this  circuit  Vx  —  F2 
and  If i2  —  if,  the  total  resistance  of  the  whole  circuit. 

Ohm’s  law  for  a  closed  electric  circuit  is 

IR  =  S, 

where  %  is  the  algebraic  sum  of  all  the  emf’s  applied  in  the 
circuit. 

If  a  closed  circuit  consists  of  a  source  of  electrical  energy  with 
an  emf  %  and  internal  resistance  r,  and  the  resistance  of  the 
external  part  of  the  circuit  equals  R,  Ohm’s  law  is  of  the  form 


The  potential  difference  across  the  terminals  of  the  source  is 
equal  to  the  voltage  across  the  external  part  of  the  circuit: 

V,  —  V2  =  RI  =  S  -  Ir. 

If  the  circuit  is  open  and  has  no  current  (I  —  0),  the  potential 
difference  across  the  terminals  of  the  sourco  is  equal  to  its  emf: 

Ia  -Vt  =  S. 
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A  voltmeter  connected  in  parallel  with  portion  1-2  o!  a  direct 
current  (d.c.)  electric  circuit  measures  the  potential  difference 
across  the  ends  of  this  portion,  and  not  the  voltage: 

H  V1  V  —  vi  —  h'2, 

where  Ry  and  7y  are  the  resistance  of  the  voltmeter  and  its 
current  (Fig.  21.3).  This  follows  from  the  generalized  Ohm’s 
law  (21.2.3)  as  written  for  portion  1-2  of  the  voltmeter  circuit, 
in  which  there  is  no  emf  (see  also  Sect.  21.3.5). 

21.2.6  In  a  d.c.  electric  circuit  with  fixed  conductors,  the  work 
done  by  the  extraneous  forces  is  completely  consumed  in  heating 
l  he  conductors  (20.3.5).  The  energy  E,  evolved  in  a  circuit  during 
the  time  l  throughout  the  whole 
volume  of  the  conductor,  is 
equal  to 

/<;  --  iut., 

where  /  is  the  current  and  U  is 
I  he  voltage.  The  quantity  of 
heal  Q  (9.2.1)  in  calories,  cor¬ 
responding  to  this  energy  and 
evolved  in  the  conductor,  is 

n  --=  0.24  IUt.  . 

All  the  other  quantities  are  expressed  in  SI  units  (Appendix  I). 
The  Joule-Lenz  law  states  that  the  quantity  of  heat  generated 
by  current,  in  a  conductor  is  proportional  to  the  current,  voltage 
and  time  that  the  current  flows.  Other  expressions  of  the  Joule- 
Lenz  law  are: 

Q  —  Q.2iI2Rl  =  0.24-^-  t. 

H 


21.3  KirchhoH's  Laws 

21.3.1  The  analysis  of  complex  (branched)  circuits  consists  in 
finding  the  currents  in  the  various  portions  of  the  circuit  in 
accordance  with  their  given  resistances  and  applied  emf’s. 


’ig.  21.3 
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21.3.2  A  branch  point ,  or  junction,  in  a  branched  circuit,  or 
network,  is  a  point  where  there  are  more  than  two  possible 
directions  of  the  current  (Fig.  21.4). 

Kirchhoff's  first  law  (point  rule)  slates  that  the  algebraic  sum 
of  the  currents  that  meet  at  any  junction  of  a  network  equals 
zero: 

1=71 

v  fj  0, 

i~  1 


where  n  is  the  number  of  conductors  joined  at  the  branch  point 
and  1 1  is  the  current  at  the  point.  Currents  flowing  toward  the 

branch  point  are  considered  posi¬ 
tive;  those  flowing  away  are  con¬ 
sidered  negative. 

Kirchhoff’ s  second  law  (mesh  rule) 
states  that  in  any  closed  loop 
(mesh  of  a  network),  arbitrarily 
chosen  in  a  branched  circuit  (net¬ 
work),  the  algebraic  sum  of  tho 
products  of  the  currents  It  by  the 
resistances  Rt  of  the  corresponding 
portions  of  the  loop  is  equal  to 
the  algebraic  sum  of  the  emf’s  in 
the  loop.  Thus 


Fig.  21.4 


i=n\  %—n\ 

2  hRt=  2 

i  =1  i=l 


where  n,  is  the  number  of  portions  into  which  tho  loop  is  divided 
by  branch  points  (junctions).  Inapplying  the  mesh  rule,  a  definite 
direction  around  the  loop  is  chosen  (either  clockwise  or  counter¬ 
clockwise).  Currents  coinciding  in  direction  with  the  chosen 
one  are  considered  positive.  The  emf’s  of  the  sources  of  electrical 
energy  are  considered  positive  if  they  produce  currents  whose 
direction  coincides  with  the  chosen  one. 

21.3.3  The  following  procedure  is  employed  to  analyse  a  complex 
(branched)  d.c.  circuit: 

(a)  The  directions  of  the  currents  are  arbitrarily  assigned  in  all 
the  portions  of  the  circuit. 

(b)  For  m  branch  points  (junctions),  m  —  1  independent  equa¬ 
tions  are  written  on  the  basis  of  Kirchhoff’s  first  law. 
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|i  )  Arbitrary  closed  loops  are  separated  out  and,  after  choosing 
I  lie  direction  for  going  around  each  loop,  a  system  of  equations 
is  written  on  the  basis  of  Kirchhoff’s  second  law.  In  a  branched 
circuit  consisting  of  p  portions  (branches)  between  adjacent 
branch  points  and  m  branch  points  (junctions),  the  number  of 
independent  equations  based  on  the  mesh  rule  equals  p  —  m  - 1- 
I .  In  selecting  the  loop,  it  is  necessary  for  each  new  loop  to 
contain  at  least  one  portion  (branch)  not  included  in  the  pre¬ 
viously  considered  loops. 

2 Id!. 4  To  shunt  an  ammeter  means  to  connect  an  additional 
resistor  Rs^  in  parallel  to  the  ammeter.  This  is  done  so  that 


big.  21.5  Fig.  21.0 


I  lie  ammeter,  having  the  resistance  R„  and  designed  to  measure 
a  maximum  current  of  can  be  applied  to  measure  currents  / 
l  bat  exceed  current  I0  (Fig.  21.5).  The  required  resistance  of 
I  he  shunt  is  foupd  according  to  Kirchhol'f’s  laws:  /  =  I0  -f-  /sh 
and  T0R0  =  IshRSh ,  from  which  Ish  is  eliminated: 


21.3.5  A  potential  difference  Vl  —  V2  =  U  across  a  portion 
ol  a  circuit  is  to  be  measured  by  a  voltmeter  designed  for  a  maxi¬ 
mum  measurement  of  (J0  volts  (21.2.5)  at  a  maximum  permis¬ 
sible  current  of  70  (with  U0  =  /„/?„)  and  this  potential  differ¬ 
ence  exceeds  U0  (i.e.  U  U0).  This  can  be  done  by  connecting 
an  additional  resistor  Ra  in  scries  with  the  voltmeter  (Fig.  21.6). 
The  required  additional  resistance  is  determined  from  the  equa- 
lion  U  =  (R0  -j-  7fa)  70,  from  which 

A  a  =  -j - R  o- 

to 
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CHAPTER  22  ELECTRIC  CURRENT  IN  LIQUIDS 
AND  GASES 

22.1  Faraday's  Laws  of  Electrolysis. 

Electrolytic  Dissociation 

22.1.1  Liquids  are  conductors  of  electric  current  (electrolytes, 
ion  conductors)  provided  that  ordered  motion  of  ions  can  be 
accomplished  in  them  by  the  action  of  an  external  electric 
fields. 

The  ordered  motion  of  ions  in  conducting  liquids  takes  place 
in  an  electric  held  set  up  by  electrodes,  which  are  conductors 
connected  to  the  terminals  of  a  source  of  electrical  energy.  The 
positive  electrode  is  called  the  anode,  the  negative  electrode, 
the  cathode.  Positive  ions,  called  cations,  are  the  ions  of  metals 
and  hydrogen,  and  move  toward  the  cathode.  Negative  ions, 
called  anions,  are  the  ions  of  acid  radicals  and  of  the  hydroxyl 
group,  and  move  toward  the  anode.  Electric  current  in  electro¬ 
lytes  is  accompanied  by  the  phenomenon  of  electrolysis.  This 
is  the  deposition  on  the  electrodes,  or  the  liberation  at  them, 
of  the  components  of  the  substances  contained  in  the  electrolyte 
or  of  other  substances  produced  by  secondary  reactions. 

22.1.2  Faraday's  first  law  of  electrolysis  states  that  the  mass  M 
of  a  substance  deposited  on  or  liberated  at  an  electrode  is  di¬ 
rectly  proportional  to  the  electric  charge  Q  passing  through  the 
electrolyte.  Thus 

M  =  kQ  =  kit. 

if  direct  current  I  is  passed  during  the  time  t  through  the  elec¬ 
trolyte. 

The  proportionality  factor  k  is  called  the  electrochemical  equiv¬ 
alent  of  the  substance.  It  is  numerically  equal  to  the  mass  of 
the  substance  liberated  when  a  unit  electric  charge  passes 
through  the  electrolyte  and  depends  upon  the  chemical  nature 
of  the  substance. 

22.1.3  Faraday's  second  law  of  electrolysis  states  that  the  elec¬ 
trochemical  equivalents  of  the  elements  are  directly  proportional 
to  their  chemical  equivalents: 
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whore  C  is  a  certain  constant  universal  for  all  the  elements, 
and  kch  is  the  chemical  equivalent  and  is  equal  to 


*C.|,  =  Ig¬ 


nore  wa  is  the  atomic  mass  of  the  element  (in  kg/mol),  and  z 
is  its  valency. 

Hence,  k  =  —  -  ,  where  />’=10-3/C  is  Faraday's  constant 

(Appendix  II). 

22.1.4  Faraday's  united  law  of  electrolysis, 


M  = 


11  ma 
F  z 


Q, 


enables  the  physical  meaning  of  F  to  be  cleared  up:  at  M  — 
mjz,  Faraday’s  constant  F  =  Q.  The  amount  of  substance 
equal  to  1/z  moles  is  called  its  gram  equivalent.  At  z  =  1  the 
gram  equivalent  of  ft  substance  is  equal  to  one  mole.  Faraday’s 
constant  is  numerically  equal  to  the  electric  charge  that  must 
bo  passed  through  an  electrolyte  to  deposit  one  gram  equivalent 
of  any  substance  on  the  electrode. 

22.1.5  The  splitting  of  neutral  molecules  into  oppositely  charged 
ions  as  a  result  of  interaction  between  the  dissolved  substance 
(solute)  and  the  solvent  is  called  electrolytic  dissociation.  It  is 
caused  by  the  thermal  motion  of  the  polar  molecules  (18.1.4) 
of  the  solute,  consisting  of  interrelated,  oppositely  charged  ions 
(39.4.3),  and  by  the  interaction  of  these  molecules  with  tho 
polar  molecules  of  the  solvent.  Both  causes  lead  to  weakening 
of  the  heteropolar  bonds  in  ionic  molecules  (39.4.3)  and  to  the 
conversion  of  such  molecules  into  two  oppositely  charged  ions. 
The  degree  of  dissociation  a.  is  the  ratio  of  the  number  n’  of 
molecules  dissociated  into  ions  in  a  certain  volume  to  the  total 
number  nn  of  molecules  of  the  solute  in  the  same  volume:  a  == 
=  n'/na. 

22.1.6  The  process  opposite  to  electrolytic  dissociation  (22.1.5) 
is  called  deionization.  It  consists  in  the  recombination  of  ions 
of  opposite  signs  into  neutral  molecules.  Under  conditions  of 
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dynamic  mubile  equilibrium  between  Ibe  dissociation  and 
deionization  processes,  a  is  determined  by  the  equation 

1  —  a 

- s —  =  const  X  n0. 


As  n0  0,  we  have  a  ->  1,  i.e.  in  weak  solutions  almost  all 
the  molecules  dissociate.  The  degree  of  dissociation  a  decreases 
with  an  increase  in  the  concentration  of  the  solution.  In  highly 
concentrated  solutions 


const, 


22.2  Atomicity  of  Electric  Charges 

22.2.1  It  follows  from  Faraday's  laws  of  electrolysis  that  all 
electric  charges  consist  of  a  whole  number  of  elementary  indi¬ 
visible  charges. 

22.2.2  The  charge  Q  of  any  ion  is  equal  to 


where  z  is  the  valency  of  the  ion,  /■'  is  Faraday’s  constant 
(22.1.4),  and  N&  is  Avogadro’s  number  (Appendix  II).  The 
charge  of  a  monovalent  ion  is  equal  to  the  charge  e  of  the  elec¬ 
tron  or  proton: 

Q1  =  e  —  1.602  X  10~19  C  —  4.803  X  10~10  electrostatic  units. 

Any  electric  charge  consists  of  a  whole  number  of  elementary 
charges  e  (14.1.2). 


22.3  Electrolytic  Conduction  of  Liquids 

22.3.1  The  current  density  j  (20.2.3)  in  an  arbitrary  section  SS, 
perpendicular  to  the  direction  of  ion  motion  (Fig.  22.1),  is 
equal  to  the  sum  of  the  current  densities  of  the  positive  and 
negative  ions: 

j  =  j+  +  j-- 
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lliT(!  j+  =  q+n„+  (v+)  and  j_  =  y_re0_  (v_),  where  q+  and 
7  ,  //„_(_  and  n0_,  and  (v+)  and  (v_)  are  the  charges,  concen- 
I  nil  ions  and  average  velocities  of  ordered  motion  (drift  duo 
In  the  action  of  the  electric  held) 

"f  the  positive  and  negative  ions. 

22.2.2  The  average  velocities  of 
mu  drift  are  proportional  to  the 
electric  field  strength  E: 

<v  |  }  =  «+E  and 
(v  ->  =  — u_ E, 

where  and  «_  are  the  mobilities 
W  I  he  positive  and  negative  ions. 

The  mobility  of  an  ion  is  equal  to 
l  lie  ratio  of  the  magnitude  of  the 
average  drift  velocity  vector  to 
that  of  the  held  strength  vector. 

The  mobility  is  independent  of  the 
held  strength  vector  E.  Since  there  are  no  hulk  charges  in 
electrolytes,  g+n0+  +'g_n0_  =  0. 

Moreover,  q+  =  ez+  —  z+  (22.2.2). 

22.2.3  Ohm's  law  of  current  density  in  electrolytes  (cf.  Sect.  20. 3. 4) 


j  =  jj—  z+n0+  (u+  +  u_)  E. 


The  resistivity  p  of  an  electrolyte  (20.3.4)  is  equal  to 
Na 

(>  = - — - 

Fz+n0+  (b+  +  u._) 

If  a  molecule  of  the  solute  dissociates  into  /q_  positive  and  /c_ 
negative  ions,  then 

/i+z_|_  =  /c_z_;  re0^_  — 1  A-_(.a»0  and  —  k_an(j, 
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whore  a  is  the  degree  of  dissociation,  n0  is  the  concentration 
of  the  solute  (22.1.5)  and 
IV  A 

O  - - — - 

(l*++  Ii_) 

The  ratio  NA/z+  is  the  number  of  positive  ions  in  one  gram 
equivalent  (22.1.4).  We  introduce  the  quantity 

r  k+n0z+  /f_ra02_ 

JVa  ~  Na  ’ 

which  is  called  the  equivalent  concentration  of  the  solution. 
It  is  the  number  of  gram  equivalents  of  the  ions  of  one  sign 
contained  in  unit  volume  of  the  electrolyte  (either  in  the  free 
state  or  bound  in  the  molecules).  Then 

1 

P~  FCa,  (u+  +  u_)  • 


22.4  Electrical  Conduction  in  Gases 

22.4.1  Gases,  consisting  of  neutral  atoms  and  molecules,  are 
insulators  and  do  not  conduct  an  electric,  current.  Gases  arc 
capable  of  conduction  after  they  are  ionized. 

The  ionization  of  a  molecule  (or  atom)  consists  in  the  detachment 
of  one  or  several  electrons,  after  which  the  molecule  (or  atom) 
is  converted  into  a  positive  ion.  If  a  molecule  (or  atom)  gains 
electrons,  it  becomes  a  negative  ion. 

The  reverse  process,  in  which  electrons  join  a  positive  ion  to 
form  a  neutral  molecule  (or  atom),  is  called  recombination. 

22.4.2  To  ionize  a  molecule  (or  atom)  it  is  necessary  to  perform 
the  mark  of  ionization  W\  to  overcome  the  forces  of  attraction 
between  the  electron  being  detached  and  the  other  particles 
of  the  molecule  (or  atom).  The  work  Wi  depends  upon  the  energy 
state  of  the  electron  being  detached  (39.1.9)  in  the  atom  or 
molecule  of  the  given  gas.  The  ionization  energy  increases 
with  the  electron  detachment  factor,  i.c.  with  the  number  of 
electrons  detached  from  each  atom. 

22.4.3  The  ionization  potential  Fj  is  the  potential  difference 
that  the  charged  particle  must  traverse  in  an  accelerating  elec¬ 
tric  field  in  order  to  accumulate  energy  equal  to  the  work  of 
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ionization  W\ .  Thus  V\  =  W\!e,  where  e  is  the  magnitude  of  the 
charge  of  the  particle. 

22/i/i  A  gas  is  ionized  by  external  effects:  raising  the  tempera- 
l  ure  sufficiently,  the  action  of  various  kinds  of  radiation,  cosmic 
radiation,  bombardment  of  the  molecules  (or  atoms)  of  the  gas 
by  high-speed  electrons  or  ions,  etc.  The  intensity,  or  rate,  of 
ionization  is  measured  by  the  number  of  pairs  of  particles  of 
opposite  sign  formed  in  unit  time  in  unit  volume  of  the  gas. 
22.4.5  Collision  ionization  is  the  process  in  which  a  gas  is  ionized 
by  the  action  of  fast-moving  electrons  or  by  ions.  The  minimum 
kinetic  energy  (3.2.1)  that  an  ionizing  particle  must  have  is 
assessed  by  applying  the  laws  of  conservation  of  momentum  and 
energy.  Thus 


where  Wi  is  the  work  of  ionization,  m  is  the  mass  of  the  electron 
and  M  is  the  mass  of  the  atom. 

The  smaller  the  ratio  ml M ,  the  closer  this  energy  is  to  fVj.  An 
electron  and  a  single-charge  ion  accumulate  the  same  energy 
A'  —  e  Ah  if  they  traverse  the  same  potential  difference  Ah. 

It  follows  from  the  preceding  formula  that  for  collision  ioniza¬ 
tion  by  electrons  and  ions,  the  latter  of  a  mass  104  times  that 
of  the  electron,  ions  must  travel  across  a  greater  potential 
difference  in  an  accelerating  field  than  electrons. 


22.5  Various  Types  of  Gas  Discharges 

22.5.1  A  gas  discharge  is  the  process  of  current  conduction  through 
a  gas. 

A  semi-self -maintained  gas  discharge  is  one  caused  by  external 
ionizing  agents  (22.4.4)  and  ceasing  when  the  ionizing  action 
stops.  The  curve  in  Fig.  22.2  shows  the  dependence  of  the  cur¬ 
rent  /  in  a  semi-self-maintained  discharge  on  the  voltage  U 
across  the  electrodes.  In  the  1st  region  of  the  curve,  at  low  volt¬ 
ages,  Ohm’s  law,  similar  to  the  one  for  electrolytes  (22.3.3), 
is  valid.  If  electrons  and  monovalent  ions  are  formed  in  the 
gas,  then 


j  =  en„  (u+  -I-  u_)  E, 
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where  n0  is  the  number  of  pairs  of  oppositely  charged  particles 
in  unit  volume,  u+  and  u._  are  the  mobilities  of  the  positive 
and  negative  ions,  and  e  is  the  magnitude  of  the  charge  of  the 
electron.  In  a  wide  pressure  range,  from  10  to  107  Pa,  the  mo¬ 
bility  of  gaseous  ions  is 
inversely  proportional  to  the 
pressure. 

22.5.2  The  relation  between 
the  current  /  and  the  vol¬ 
tage  U  is  no  longer  linear 
in  the  2nd  region  of  the 
curve  in  Fig.  22.2  because 
the  ion  concentration  decrea¬ 
ses  in  the  gas.  In  this  region 
w  the  current  increases  slower 
/  <-  U6  J  4  an(]  slower  with  an  increase 

in  U . 

Fig.  22.2  Beginning  with  a  certain 

voltage  f/s,  the  current  re¬ 
mains  constant  in  the  3rd 
region  of  the  curve  in  Fig.  22.2  when  the  voltage  is  in¬ 
creased.  This  occurs  because,  at  a  constant  rate  of  ionization 
(22.4.4)  in  strong  electric  fields,  all  the  ions  formed  in  unit 
time  in  the  gas  reach  the  electrodes.  No  further  increase  in  the 
current  takes  place  at  a  fixed  rate  of  ionization.  Saturation 
results  because  all  the  charged  particles  produced  in  the  gas 
travel  in  a  strong  electric  field  and  reach  the  electrodes  before 
an  appreciable  part  of  them  have  enough  time  to  recombine  with 
particles  of  opposite  charge. 

The  saturation  current  Is  is  the  maximum  current  that  can  he 
obtained  at  a  given  rate  (intensity)  of  ionization: 

h  --  eNu, 

where  N0  is  the  maximum  number  of  pairs  of  monovalent  ions 
formed  in  the  gas  in  unit  time  at  a  given  rate  of  ionization. 
The  linear  relation  between  /s  and  N  „  confirms  the  ionic  nature 
of  electrical  conduction  in  gases. 

22.5.3  A  self-maintained  gas  discharge  is  one  that  continues 
after  the  action  of  the  external  ionizing  agent  ceases.  To  obtain 
such  a  discharge  it  is  necessary  for  new  pairs  of  oppositely  char¬ 
ged  particles  to  be  formed  continuously  in  the  gas.  The  main 
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source  of  these  particles  is  collision  ionization  (22.4,5).  At 

a  certain  sufficiently  high  voltage  across  the  electrodes,  the 

electrons  are  so  highly  accelerated  by  the  electric  lield  that 

l  heir  energy  proves  adequate  to  ionize  the  gas  molecules 

(22.4.5).  This  is  called  volume  ionization.  Secondary  electrons, 

accelerated  in  the  electric  field,  also  ionize  the  gas  molecules. 

As  a  result,  the  number  of  current  carriers  in  the  gas  and  its 

electrical  conductance  are  greatly  increased  (4th  region  in 

I’’ if?.  22.2).  But,  by  itself,  ionization  by  the  action  of  electrons 

is  insufficient  to  produce  a  self- 

maintained  discharge.  Electrons  d(,A 

l  ravelling  in  the  direction  from 

the  cathode  toward  the  anode  1 

ionize  the  molecules  of  gas  locat-  \ 

ed  closer  to  the  anode  than  the  \ 

place  where  the  electrons  are  \ 

emitted.  If  the  energy  of  the  \ 

positive  ions  is  insufficient  for  - - 

collision  ionization  of  the  gas  ^ 

molecules  or  lor  knocking  ^  pd 

electrons  out  of  the  metallic 


cathode  (surface  ■  ionization ),  p.  99  3 
electrons  can  be  produced  near  " ’ 
the  cathode  of  a  gas-discharge 

tube  only  by  external  ionizing  agents.  When  the  action  of 
these  agents  is.  stopped,  the  region  of  collision  ionization 
by  electrons  contracts  toward  the  anode  as  the  electrons 
move  in  this  direction.  Under  these  conditions  the  gas 
discharge  is  extinguished  as  soon  as  collision  ionization  ceases. 
22.5.4  Surface  ionization  (22.5.3),  initiated  at  high  voltage  U, 
produces  secondary  electrons  and  sets  up  a  two-way  avalanche 
of  electrons  and  positive  ions.  After  this  the  action  of  the  exter¬ 
nal  ionizing  agent  is  no  longer  essential  for  continuing  the  gas 
discharge.  When  the  voltage  U  across  the  electrodes  of  a  gas- 
discharge  tube  is  raised  sufficiently,  an  electric  breakdown  of  the 
gas  occurs.  This  is  the  transition  from  a  seini-self-maintained 
to  a  self-maintained  gas  discharge.  The  voltage  U  —  U bp, 
corresponding  to  electric  breakdown,  is  called  the  ignition 
potential,  or  breakdown  voltage.  The  breakdown  voltage  for 
a  gas  discharge  in  a  tube  having  flat  parallel  electrodes  at  the 
distance  d  from  each  other  depends  upon  the  product  pd,  whore 
p  is  the  gas  pressure  (Fig.  22.3).  In  addition,  V depends 
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on  the  chemical  nature  of  the  gas  and  the  material  of  the 
cathode. 

Sell-maintained  gas  discharges  are  classified  into  glow,  corona, 
spark  and  arc  discharges.  They  are  not  discussed  in  the  present 
handbook. 

22.6  Certain  Information  on  Plasma 

22.6.1  Plasma  is  a  special  state  of  matter,  whose  most  important 
property  is  a  predominant,  or  even  complete,  ionization  of  the 
particles  of  matter.  The  degree  of  ionization  a  is  the  ratio  of  the 
number  of  ionized  particles  to  their  initial  number.  Plasma  is 
classified  as  weakly  ionized  (when  a  is  a  fraction  of  one  per  cent), 
moderately  ionized  (when  a  equals  several  per  cent)  and  fully 
ionized  (when  a  is  close  to  100  per  cent). 

Weakly  ionized  plasma  is  found  in  the  ionosphere ,  the  conducting 
layer  of  the  atmosphere.  The  ionosphere  extends  from  60  to 
2  X  104  km  above  the  earth’s  surface. 

Fully  ionized  plasma,  formed  at  ultra-high  temperatures  (high- 
tempcralure  plasma),  exists  on  the  sun  and  hot  stars.  Under 
laboratory  conditions,  plasma  is  produced  in  gas  discharges 
(22.5.1)  and  gaseous-discharge  lamps.  Accelerated  plasma  is 
employed  as  the  working  medium  (11.1.1)  in  jet  engines.  Plasma 
can  also  be  used  for  the  direct  conversion  of  internal  energy 
into  electrical  energy  (magnitohydrodynamic  generators  and 
plasma  sources  of  electrical  energy). 

The  great  number  of  charged  particles  in  a  plasma  provides  for 
its  high  electrical  conductance;  in  this  aspect  its  properties 
approach  those  of  conductors  of  electric  current. 

22.6.2  A  condition  for  the  existence  of  a  plasma  is  a  certain 
minimum  density  pmin  of  charged  particles.  Beginning  with 
this  density,  we  can  speak  of  a  plasma  rather  than  a  simple 
accumulation  of  separate  charged  particles.  The  density  pmin 
is  determined  by  the  inequality  L  >  D,  where  L  is  a  linear 
dimension  of  the  system  of  charged  particles  and  D  is  a  typical 
“plasma”  parameter,  the  length  called  the  Debye-lliickel  screen¬ 
ing  radius  (it  is  also  called  the  Debye  shielding  length  or  Debye 
length ): 

t  -1/2 

D=  ^  2j . kT.  )  (m  c8se  uni  is). 
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(2 


‘l\ni  ^ -1/2 

T^tJ  ) 


(in  SI  units), 


whore  qt  is  the  charge  of  the  £lh  species  of  particles,  ni  is  their 
concentration,  T;  is  their  temperature,  k  is  Boltzmann’s  con¬ 
stant  (8.4.5)  and  e0  is  the  electric  constant.  Summation  is  to  be 
carried  out  over  all  the  species  of  particles.  The  quantity  D 
is  the  distance  over  which  the  Coulomb  field  of  any  charge  of 
the  plasma  is  shielded.  The  shielding  occurs  because  any  charge 
is  predominantly  surrounded  by  oppositely  charged  particles. 
According  to  a  more  exact  definition,  a  plasma  is  a  quasi- 
neutral  assembly  of  a  large  number  of  charged  particles  that 
occupy  a  region  of  space  with  the  linear  dimensions  L  >  D. 
The  violation  of  quasi-neutrality  in  the  plasma  is  eliminated 
by  the  strong  electric  fields  set  up  in  it.  The  Debye-1 1 iickel 
screening  radius  characterizes  the  interaction  of  the  particles 
in  the  plasma.  It  was  found  that  Ep/E^cc  jV2/3,  where  N  =4 
-  4ji71jD3/3.  Here  E p  is  the  potential  energy  of  interaction  be¬ 
tween  two  particles  located  at  an  average  distance  of  /i"1/3  from 
each  other  (n  being  the  concentration  of  particles),  and  E ^ 
is  the  kinetic  energy  of  these  particles.  The  quantity  N  is  the 
total  number  of  charged  particles  inside  a  sphere  of  radius  D, 
and  is  called  the  Debye  number.  If  N  is  large,  the  plasma  is 
said  to  be  gaseous  and  can  be  dealt  with  thermodynamically  as 
an  ideal  gas  with  the  state  equation  p  —  nkT  (8.4.5). 

22.6.3  Coulomb'  long-range  interaction  of  charged  particles  in 
a  plasma  leads  to  its  qualitative  uniqueness  that  enables  it 
to  be  regarded  as  a  special,  fourth  state  of  aggregation  of  matter. 
The  most  important  properties  of  plasma  are: 

(a)  its  strong  interaction,  due  to  its  high  electrical  conduction, 
with  external  magnetic  and  electric  fields; 

(b)  specific  collective  interaction  of  particles  in  the  plasma  (by 
means  of  a  special  field  whose  nature  and  origin  is  beyond  the 
scope  of  this  handbook); 

(c)  the  fact  that  due  to  long-range  interaction  it  is  a  special 
kind  of  elastic  medium  in  which  various  types  of  vibrations 
and  waves  can  be  readily  excited  and  propagated. 

22.6.4  The  motion  of  plasma  in  a  magnetic  field  is  employed 
to  directly  convert  the  internal  energy  of  an  ionized  gas  into 
electrical  energy.  This  method  is  made  use  of  in  a  magneto¬ 
hydrodynamic  (MUD)  generator.  This  generator  operates  as 
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follows:  saw  produced  in  the  combustion  of  fuel  and  constituting 
a  plasma  passes  through  a  strong  traverse  magnetic  held.  In 
a  conducting  plasma,  as  in  any  conductor  moving  in  such 
a  field,  electromagnetic  induction  (24.1.1)  occurs.  The  induced 
emf  can  be  taken  off  by  means  of  electrodes  in  the  external 
circuit.  A  schematic  diagram  of  an  MHD  generator  is  shown  in 

Fig.  22.4.  The  ionized  gas 
produced  in  fuel  combustion 
passes  through  the  nozzle 
and  its  internal  energy  is 
converted  into  kinetic  ener¬ 
gy.  As  this  gas  passes  through 
the  transverse  magnetic 
field  B,  the  positive  ions 
move  toward  the  upper  elec¬ 
trode  by  the  action  of  the 
induced  electric  field  (24.1.1) 
and  the  free  electrons  move 
toward  the  lower  electrode. 
When  wo  connect  the  elec¬ 
trodes  to  an  external  load,  a  current  is  obtained  in  the  circuit. 
22.6.5  A  state  of  thermodynamic  equilibrium  is  possible  at  a 
definite  temperature  in  the  plasma  when  the  loss  of  charged 
particles  due  to  recombination  (22.4.1)  is  recompensed  by  new 
acts  of  ionization.  The  average  kinetic  energies  of  the  various 
particles  making  up  such  a  plasma  are  the  same.  The  processes 
of  energy  exchange  between  the  particles  of  this  plasma,  as 
well  as  the  energy  exchange  of  the  plasma  with  blackbody 
radiation  (36.1.8)  arc  equilibrium  processes  (8.3.7).  A  plasma 
with  the  aforesaid  properties  is  said  to  be  isothermal.  It  exists 
in  the  atmospheres  of  high-temperature  stars. 

A  condition  required  for  a  high  degree  of  ionization  of  a  thermo¬ 
dynamically  equilibrium  plasma,  consisting  of  two  kinds  of 
charged  particles,  equal  in  magnitude  and  opposite  in  sign,  is 
the  maximum  reduction  in  the  recombination  (22.4.1)  of  the 
particles.  A  completely  ionized  plasma  can  be  obtained  at 
kT  ;»  eVj,  where  Vf  is  the  ionization  potential  (22.4.3)  of  the 
gas  atoms  and  kT  is  the  average  energy  of  thermal  motion  of 
the  plasma  particles.  For  hydrogen  and  deuterium  this  corre¬ 
sponds  to  T  ss  160  000  K.  Essential  under  these  conditions  are 
the  radiation  of  the  plasma  and  the  difficulty  of  isolating  the 
plasma  from  the  walls  of  its  container  (22.6.7). 
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22. 6. C  In  the  plasma  of  a  gas  discharge  (22.5.1)  (gas-discharge 
/i las ma)  thermodynamical  equilibrium  (22.6.5)  is  absent.  The 
charged  particles  of  such  a  plasma  are  in  an  accelerating  electric 
hold. 

The  average  kinetic  energy  of  the  electrons  in  a  gas-discharge 
plasma  is  characterized  by  a  certain  electron  temperature  Te 
corresponding  to  the  Maxwellian  energy  distribution  of  the 
electrons  (10.3.7).  Owing  to  the  lack  of  thermodynamical  equi¬ 
librium  in  such  a  plasma,  Te  is  of  a  conventional  nature.  The 
average  kinetic  energy  of  the  neutral  particles  is  substantially 
less  than  that  of  the  electrons.  Besides  the  electron  tempera- 
lure  Te,  the  parameters  of  a  gas-discharge  plasma  are:  the 
electron  concentration  ne,  the  number  of  ionizations  per  elec¬ 
tron  per  second,  the  density  of  the  ion  or  electron  current,  and 
the  longitudinal  electric  field  strength  Ez  established  along 
the  symmetry  axis  of  the  plasma. 

22.6.7  The  existence  of  a  thermodynamically  nonequilibrium 
gas-discharge  plasma  is  made  possible  by  the  energy  of  the 
discharge  current  passing  through  it.  In  the  absence  of  an  exter¬ 
nal  electric  field,  the  gas-discharge  plasma  disappears. 

The  disappearance  of  an  unmaintained  gas-discharge  plasma 
that  has  been  left  to  itself  is  called  deionization  of  the  gas. 

In  addition  to  the  ionization  and  recombination  processes 

(22.4.1) ,  a  large  share  of  the  energy  balance  of  a  plasma  that 
exists  in  a  limited  volume  consists  in  the  interaction  between 
the  plasma  and  the  walls  confining  its  volume,  as  well  as  the 
radiation  of  the  plasma  and  the  transport  of  radiation  in  it. 
The  diffusion  of  the  charged  particles  to  the  walls  and  their 
recombination  on  the  walls,  and  the  transfer  of  energy  to  the 
walls  by  heat  conduction  (10.8.2)  contaminate  the  plasma  with 
impurities  and  reduce  its  energy.  To  avoid  these  phenomena, 
measures  are  taken  to  confine  the  plasma  by  a  magnetic  field 
to  prevent  contact  with  the  walls. 

Radiation  of  the  plasma  in  the  optical  range  and  in  the  far 
ultraviolet  consists  of:  braking  radiation  (bremsstrahlung) 
of  the  electrons  that  occurs  when  they  are  retarded  by  the  ions, 
ordinary  radiation  produced  by  excited  particles,  and  recom¬ 
bination  radiation  that  occurs  in  the  recombination  process 

(22.4.1) .  In  a  magnetic  field,  a  plasma  also  has  a  special  beta¬ 
tron  (synchrotron)  radiation,  which  is  beyond  the  scope  of  this 
handbook. 
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CHAPTER  23  MAGNETIC  FIELD 

OF  DIRECT  CURRENT 

23.1  Magnetic  Field.  Ampere's  Law 

23  A  A  A  magnetic  field  is  one  of  the  forms  of  an  electromagnetic 
field  (15.1.2).  A  magnetic  field  is  set  up  by  moving  charged 
particles,  as  well  as  moving  bodies  carrying  electric  charges. 
Magnetic  fields  act  only  on  moving  electric  charges  and  moving 
charged  bodies. 

Variable  electric  fields  (displacement  currents)  (27.3.1)  are 
also  sources  of  magnetic  fields. 

/23.1.2  The  main  force  characteristic  of  a  magnetic  field  is 
vector  of  magnetic  induction  B  (also  called  magnetic  flux  density, 
magnetic  displacement ,  or,  simply,  magnetic  vector).  Vector  B 
is  determined  by  one  of  three  methods: 

(a)  from  Ampere’s  law  (23.1.4); 

(b)  from  the  effect  of  the  magnetic  field  on  a  current-carrying 
loop  (23.4.2); 

(c)  from  the  expression  for  the  Lorentz  force  (24.1.3). 

/23.1.3  Magnetic  fields  are  represented  graphically  by  lines  of 

magnetic  induction.  Lines  of  magnetic  induction  ( lines  of  magnet¬ 
ic  force)  are  curves  whose  tangent  at  each  point  coincides  with 
vector  B  at  this  point  of  the  field.  The  directions  of  the  magnetic 
vector  and  the  lines  of  magnetic  induction  are  determined  by  the 
corkscrew  (Maxwell)  rule  (also  called  the  right-hand  screw  rule): 
if  a  corkscrew  with  right-hand  thread  is  screwed  in  the  direction 
of  the  current  density  vector  in  the  conductor  (20.2.3),  the 
direction  that  the  corkscrew  handle  turns  indicates  the  direc¬ 
tion  of  the  lines  of  magnetic  induction  and  the  magnetic  vec¬ 
tor. 

Lines  of  magnetic  induction  are  not  interrupted  at  any  point 
of  the  magnetic  field,  i.e.  they  neither  begin  nor  end.  These 
lines  are  either  closed  curves  or  they  extend  from  infinity  to 
infinity  or  they  are  wound  infinitely  on  some  surface,  covering 
it  densely  all  over,  but  never  returning  again  to  any  point 
of  the  surface.  A  similar  situation  is  observed,  for  example, 
in  a  field  set  up  by  a  system  of  a  circular  current  and  an  infinite 
rectilinear  current  passing  through  the  centre  of  the  circular 
current  and  perpendicular  to  its  plane. 
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A  magnetic  field  is  said  to  be  uniform,  or  homogeneous,  if  vec¬ 
tor  B  is  constant  at  any  point  in  the  field.  Otherwise,  it  is  said 
to  be  nonuniform  ( nonhomogeneous ). 

29.1.4  The  force  exerted  by  a  magnetic  field  on  a  current-carry¬ 
ing  conductor  placed  in  the  Held  is  called  Ampere's  force. 
Ampere's  law  states  that  an  element  of  force  dF  exerted  on  a 
small  element  of  length  dl  of  a  conductor,  carrying  a  current 
and  located  in  a  magnetic  field,  is  proportional  to  the  vector 
product  of  the  element  dl  of  conductor  length  by  the  magnetic 
induction  B: 

dF  —  /  [dl  B]  (in  SI  units),  J 

dF=^-[dl  B]  (in  Gaussian  units,  see  Appendix  I). 


Here  dl  is  a  vector  with  the  magnitude  dl  and  has  the  same 
direction  as  the  density  vector  j  of  the  current  in  the  conduc¬ 
tor  (20.2.3). 

Ampere’s  force  F,  acting  in  a  magnetic  field  on  a  current-carry¬ 
ing  conductor  of  finite  length,  is 

F=  j  I  [dl  B] , 


in  which  integration  is  carried  out  over  the  whole  length  of  t 
conductor.  In  the  case  of  a  uniform  magnetic  field  (23.1.3) 

F  =  IBl  sin  a  (in  SI  units),  f 

/’ =  —  .BIsina  (in  Gaussian  units), 
c 


where  a  is  the  angle  between  the  density  vector  of  the  current 
in  the  conductor  and  vector  B.  The  relative  positions  of  vectors 
dF,  B  and  dl  are  shown  in  Fig.  23.1.  If  dl  is  perpendicular  to  B, 
the  direction  of  force  dF  is  found  by  the  left-hand  rule:  if  the 
left  hand,  opened  flat  with  the  fingers  parallel  to  one  another 
and  the  thumb  perpendicular  to  the  fingers,  is  held  so  that  the 
magnetic  induction  vector  enters  the  palm  and  the  fingers 
indicate  the  direction  of  the  electric  current,  the  thumb  indicates 
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the  direction  of  the  force  exerted  by  the  magnetic  field  on  the 
current-carrying  conductor.  It  is  evident  from  Fig.  23.1  that 
vector  dF  is  perpendicular  to  the  plane  passing  through  vec¬ 
tors  d  1  and  B,  and  that  its  di- 
rection  is  such  that  from  the 
!  head  of  vector  dF  the  direction 

^-["T "]  ]  I  ]  of  shortest  rotation  from  vec- 

o'Ff'f I  |  I  |  j  |  b  lor  dl  to  vector  B  is  counter- 

'  j  j  |  j  I  I  clockwise.  In  other  words,  vec- 

I  i  1 tor  dF  coincides  in  direction 
with  the  vector  product  [dl  B]. 

-  ^  23.1.5  It  follows  from  Ampere’s 

oT  law  that  the  magnetic  induction 

vector  in  SI  units  is  numerically 
Fig.  23.1  equal  to  the  limit  of  the  follow¬ 

ing  ratio:  the  force  exerted  by 
a  magnetic  field  on  an  element 
of  a  current-carrying  conductor  to  the  product  of  the  current 
by  the  length  of  an  element  of  the  conductor,  the  limit  being 
taken  as  the  length  of  the  element  approaches  zero  and  is  posi¬ 
tioned  in  the  field  so  that  the  limit  reaches  its  maximum  value. 
Thus 


Fig.  23.1 


/  \  dl  J, 


In  Gaussian  units  (Appendix  I), 


i=JL(1L) 
I  l  dl  )i 


where  c  is  the  electrodynamic  constant  (Appendix  I). 

23.1.6  In  contrast  to  electrostatic  forces  (14.2.2),  Ampere’s 
force  is  not  a  central  one  (3.3.4).  Ampere’s  force  is  perpendic¬ 
ular  to  the  lines  of  magnetic  induction  and  to  the  current- 
carrying  conductors. 


23.2  The  Biot-Savart-Laplace  Law 

23.2.1  The  Biot-Savart-Laplace  law  determines  the  magnetic 
induction  at  any  point  in  a  magnetic  field  set  up  by  a  direct 
electric  current  flowing  along  a  conductor  of  any  shape.  The 
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vector  <2B  of  magnetic  induction  at  any  point  C  of  a  magnetic 
field,  set  up  by  an  element  of  a  conductor  carrying  the  current  I 
and  of  the  length  dl  is  determined  by  the  formula 

d\i  =  k\iT  -4j-  [dl  r], 


where  dl  is  the  vector  of  an  element  of  length  of  the  conductor 
(23.1.4),  r  is  the  radius  vector  from  the  element  dl  of  the  con¬ 
ductor  to  point  C  (Fig.  23.2),  r 
is  the  magnitude  of  the  radius 
vector  r,  k  is  a  factor  depending 
only  upon  the  chosen  system  of 
units,  and  pr  is  a  dimension¬ 
less  quantity  that  characterizes 
the  magnetic  properties  of  the 
medium  and  called  the  relative 
magnetic  permeability  (or  simply 
relative  permeability)  of  the  me¬ 
dium.  The  relative  permeability 
does  not  depend  upon  the  choice 
of  units  and  equals  unity  for  vacu¬ 
um.  For  all  substances,  except  fer¬ 
romagnetic  ones,  pr  differs  only 
slightly  from  unity  (26.5.1).  See 
Sect.  26.4.5  for  a  general  defini¬ 
tion  of  p,r. 

23.2.2  In  SI  units  k  =  fx0/4jt,  where  p.0  —  4n  X  10 ~7  henry/m 
is  the  permeability  of  free  space,  or  the  magnetic  constant  (Appen¬ 
dix  II).  In  Gaussian  units,  k  —  \lc,  where  c  —  3  X  1010  cm/s 
is  the  electrodynamic  constant  (Appendix  I). 

The  Biot-Savart-Laplace  law  is  of  the  form 

This  form  of  the  Biot-Savart-Laplace  law  and  of  all  the  electro¬ 
magnetic  field  equations  is  said  to  be  rationalized. 

The  product  p.0pr  is  sometimes  called  the  absolute  magnetic 
permeability  (or  simply  permeability)  of  the  medium, 


Fig.  23.2 
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1  I 

In  Gaussian  units  dB  =  —  [dl  r],  The  magnitude  of  vector 
dB  is  equal  to 

p0Pr  I  dl  sin  a  _T  . 

- - -  (m  gj  unlts)i 

ur  I  dl  sin  a  ..  _  .  ,x  . 

dB=— - 5 -  (in  Gaussian  units), 

c  r2 

where  a  is  the  angle  between  vectors  dl  and  r. 

23.2.3  The  magnetic  field  strength  H  is  the  vector  characteristic 
of  a  magnetic  field  that,  for  a  homogeneous  isotropic  medium, 
is  related  to  B  as  follows: 

H== -  (in  SI  units), 

PoPr 

g 

H  =  - —  (in  Gaussian  units). 

Pr 

The  universal  relation  between  vectors  B  and  H  for  a  magnetic 
field  in  an  arbitrary  medium  and  the  more  general  definition 
of  the  magnetic  field  strength  vector  H  are  discussed  in  Sect. 
26.4.4. 

The  strength  of  a  magnetic  field  set  up  by  a  direct  current  in 
a  homogeneous  isotropic  medium  is  independent  of  the  magnetic 
properties  of  the  medium.  Thus 

I  ...  .  I  dl  sin  a  ..  OT  ..  . 

dH==4^  [dl  r1,  dH=  4nr2  (ln  SI  UmtS)’ 

,TT  I  rj1  ,  1TT  I  dl  sin  a  ..  _  .  «  , 

rfH  =  — r-  [d\  r],  dH  = - 5 -  (in  Gaussian  units), 

cr 3  ct1 

where  a  is  the  angle  between  vectors  dl  and  r. 

23.2.4  It  follows  from  a  comparison  of  the  characteristics  of 
electric  (E  andD)  and  magnetic  (B  and  II)  fields  that  the  electric 
field  strength  vector  E  is  analogous  to  the  magnetic  vector  B. 
Both  vectors  specify  the  force  effects  of  the  fields  and  depend 
upon  the  properties  of  the  medium  in  which  the  field  is  set 
up. 

The  analogue  of  vector  D  of  electric  displacement  (15.3.1)  is 
(,he  magnetic  field  strength  vector  H. 
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25.2.5  An  electric  charge  q,  travelling  in  an  unbounded  homo- 
gencous  and  isotropic  medium  at  the  constant  velocity  v,  sets 
up  a  magnetic  field  whose  magnetic  induction  Bg  can  be  cal¬ 
culated  by  the  formula' 


PoPr 
4  n 


,  _  p0|xr  qv  si  n  (1 
q  4ji  r2 


(in  SI  units), 


Pr 

c 


qv  sin  (5 
r2 


(in  Gaussian  units), 


where  P  is  the  angle  between  vectors  v  and  r,  and  r  is  the  radius 
vector  from  the  travelling  charge  to  the  point  A  being  considered 
in  the  field.  Vectors  Bg  and  Hg  are  perpendicular  to  the  plane 
passing  through  vectors  v  and  r. 

When  q  >  0,  the  shortest  rotation  from  vector  v  to  vector  r 
is  seen  from  the  head  of  vector  Bg  (and  Hg)  to  be  counter- 


Fig.  23.3 

clockwise  (Fig.  23.3a).  When  q  <C  0,  then  Bg  (and  Hg)  have  the 
opposite  direction  (Fig.  23.36).  The  magnetic  field  of  a  moving 
charge  is  variable,  because  the  radius  vector  r  varies  in  mag¬ 
nitude  and  direction  as  charge  q  travels  even  if  v  =  const.  The 
magnetic  field  of  a  moving  charge,  depending  upon  angle  P 
between  vectors  v  and  r,  is  not  spherically  symmetrical,  as  in 
the  case  of  the  electrostatic  field  of  a  point  charge  (14.2.3). 
The  magnetic  field  being  considered  has  mirror  symmetry  with 
respect  to  the  direction  of  v.  The  field  strength  is  maximum  at 
points  in  a  plane  passed  through  the  charge  perpendicularly 
to  vector  v  (under  the  condition  that  v  <  c).  At  all  points  on 
a  straight  line  coinciding  with  vector  v  there  is  no  magnetic 
field  induction, 
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23.3  Simplest  Cases  o!  Magnetic  Fields  Set  Up  by 
Direct  Currents 


M 


23.3.1  Making  use  of  the  Biot-Savart-Laplace  law  we  can  deter¬ 
mine  the  characteristics  (B  and  H)  of  a  magnetic  field  set  up  by 
an  electric  current  flowing  in  a  con¬ 
ductor  of  finite  dimensions  and  arbi¬ 
trary  shape.  According  to  the  prin¬ 
ciple  of  superposition  of  fields  (15.2.2) 
the  magnetic  induction  B  at  an  arbi¬ 
trary  point  of  a  magnetic  field  of  a 
conductor  carrying  the  current  /  is 
equal  to 


U>N 

'r, 


\ 4 
-^3 


B 


-5 


dB 


N 


Fig.  23.4 

netic  field  with 
point  A  equal 


where  dB  is  the  magnetic  induction 
of  the  field  set  up  by  an  element 
of  the  conductor  with  the  length  dl . 
Integration  is  carried  out  over  the 
whole  length  L  of  the  conductor. 
23.3.2  A  straight  conductor  MN  car¬ 
rying  the  current  I  sets  up  a  mag- 
induction  B  and  strength  H  at  an  arbitrary 
to* 


B _  J_  (cos  (pi _ cos  (p2)i  jj—  B  (in  si  units), 

4n  Fq  Pol*r 

1  /  B 

B  =  —  [ir — (cos  <p1  — cos  tp2),  H  —  —  (in  Gaussian  units), 
c  r0  Hr 


where  rQ  is  the  distance  from  point  A  to  the  conductor,  <px  and 
cp2  are  the  angles  formed  by  the  radius  vectors  from  point  A  to 
the  beginning  and  end  of  the  conductor  (Fig.  23.4),  (ir  is  the 


*  It  is  assumed  in  all  the  examples  of  Sect.  23.3  that  the  medium 
is  homogeneous,  isotropic  and  completely  fills  all  the  space  in 
which  the  magnetic  field  exists. 
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relative  magnetic  permeability  of  the  medium,  and  p0  is  the 
magnetic  constant  in  SI  units  (Appendix  II). 

For  an  infinitely  long  conductor  (ipj  =  0  and  q)2  =  n) 


B  p0pr  2 1  u_  B 
An  r0  ’  p0pr 


(in  SI  units), 


Z?  =  —  pr  —  ,  //  =  —  (in  Gaussian  units), 

c  r0  Mr 


23.3.3  For  the  magnetic  field  at  the  centre  of  a  rectangular  turn 
(loop)  carrying  the  current  I 


Wr  8/  / a2-f  b2 
4ji  ab 


H— -  (in  SI  units), 

Mo  Hr 


„  1.,  8  lYa*  +  b* 

=  - ab - 


H  =  —  (in  Gaussian  units), 
Mr 


where  a  and  b  are  the  sides  of  the  rectangle. 

23.3.4  The  magnetic  moment  pm  oj  a  closed  loop  of  arbitrary 
shape,  carrying  the  current  /,  is  equal  to 

pm  =  I  j  n  dS,  (in  SI  units), 

S 

pm  =  i/  j  n  dS  (in  Gaussian  units), 
s 


where  n  is  a  unit  vector  of  the  outward  normal  to  element  dS 
of  the  surface  S ,  bounded  by  the  current-carrying  loop.  In  the 
case  of  a  flat  loop,  surface  5  is  plane  and  all  the  normals  have 
the  same  direction.  Hence 


pm  =  /5n,  pm  =  IS  (in  SI  units), 

1  1 

pm  =  —  ISn,  pm=  — /5  (in  Gaussian  units). 
9  9 


298 


MAGNETIC  FIELD  OF  DIRECT  CURRENT  [23 


Vector  pm  is  directed  so  that  from  its  head  the  current  in  the 
circuit  is  seen  to  be  flowing  counterclockwise  (Fig.  23.5). 


Fig.  23.5  Fig.  23.6 


23.3.5  The  magnetic  field  set  up  by  a  circular  turn,  or  loop, 
carrying  the  current  /,  at  an  arbitrary  point  A  on  the  axis  of 
the  turn  (loop)  (Fig.  23.6),  is  specified  by 


B  =  pr 


4 n  (i?2  +  fe2)3/2  ’ 
2  Pin  r 


(f?24-fc2)3/2  ’ 


,  H  = -  (in  SI  units), 

p.0pr 

g 

H  = —  (in  Gaussian  units). 

Pr 


Here  pm  is  the  magnetic  moment  of  a  current-carrying  turn, 
or  loop,  (23.3.4). 

The  magnitudes  of  vectors  B  and  H  equal 


PoPr  IR 2  _  Po  MS 

2  (H2  +  ^2)3/2  2n  (R2  /i2)3/2  ’ 

H  —  ■  (in  SI  units), 

PoPr 

1  2nim  1  2  IS 

b  c  ^  (H2  +  A2)3/2- c  (Xp  (fi2  +  /i2)3/2 

II  =  —  (in  Gaussian  units), 

pr 
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whore  h  is  the  distance  from  point  A  to  the  centre  of  the  turn 
(loop),  n  is  the  radius  and  S  is  the  area  of  the  turn. 

2:t..'f.6  The  magnetic  field  at  the  centre  of  a  circular  turn,  or 
loop,  has  the  characteristics  (23.3.5): 

II  _  ,  H= — (in  si  units), 

in  R3  p0pr 

2d  b 

lt  =  pr  t!™  ,  H  =  —  (in  Gaussian  units). 

R3  Pr 

The  magnitudes  of  vectors  B  and  II  equal 

//  =  p0pr  H=2 F  (iD  SI  units)’ 

2n  I  „  1  2nl  „  .  ... 

11  = - pr  —  ,  //= - —  (in  Gaussian  units). 

c  i?  c  R  ' 

The  direction  of  the  magnetic  field  is  along  the  axis  of  the  loop 
and  perpendicular  to  its  plane  (Fig.  23.6). 


Fig.  23.7 

23.3.7  A  toroid  is  an  annular  coil  wound  on  a  core  having  the 
shape  of  a  torus  (Fig.  23.7).  The  magnetic  field  of  a  toroid  is 
completely  confined  within  its  volume, 
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The  characteristics  of  this  magnetic  field  are  calculated  by  the 
formulas 


B==v‘oV‘T  2nr  '  H=2^F  (in  SI  units>- 

„  1  2A7  „  1  27V/  .. 

B  = —  ur - ,  //  = - (m  Gaussian  units). 

c  §  c  r 

The  magnetic  induction  B  and  the  strength  II  of  the  magnetic 
field  on  the  circular  centre  line  of  the  toroid  are  equal  to 

NI 

H  =  nl  (in  SI  umts).Q 

Here  N  is  the  number  of  turns  of  the  toroid,  I  is  the  current 
in  the  turns,  r  is  the  radius  of  a  circle  in  the  toroid,  Rc\  = 
=  (R1  +  R2)l 2,  /?!  and  R2  are  the  external  and  internal  radii 
of  the  torus,  and  n  is  the  number  of  turns  per  unit  length  of 
the  toroid’s  centre  line. 

23.3.8  A  solenoid  is  a  cylindrical  coil  consisting  of  a  large  number 
of  turns  of  wire  in  the  form  of  a  helix.  If  the  turns  are  wound 
tight  up  against  or  suficiently  close  to  one  another,  the  solenoid 
is  dealt  with  as  a  system  of  circular  closed  current-carrying 
loops  connected  in  series  and  having  the  same  radius  and  a 
common  axis. 

The  magnetic  moment  pm  (23.3.4)  of  the  solenoid  is  equal  to 
the  vector  sum  of  the  magnetic  moments  of  all  N  of  its  turns. 
Thus 

Pm  =  NISn, 

where  I  is  the  current  in  the  solenoid’s  turns,  S  is  the  area  of 
a  turn,  and  n  is  a  unit  vector  of  the  normal  to  the  plane  of  a 
turn.  Vector  pm  is  along  the  axis  of  the  solenoid  and  coincides 
in  direction  with  the  magnetic  field  as  determined  by  the  cork¬ 
screw  rule  (23.1.3). 

The  magnetic  induction  B  and  strength  II  of  the  solenoid’s 
magnetic  field  at  an  arbitrary  point  A,  lying  on  its  axis,  are 
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numerically  equal  to 

^ iMir. n/ (c0 sa2— cosaj),  II  =-  B  -  (in  SI  units), 

Z  PoPr 

1  H 

II  —  rnf  (cosa2 — cos  a,),  H  =  —  (in  Gaussian  units), 

c  Pr 


where  n  —  N/L  is  the  number  of  turns  per  unit  length  of  the 
solenoid,  a2  and  ax  are  angles  at  which  the  ends  of  the  solenoid 
mo  seen  from  point  A  (a2  <  a-,), 


cos  a1  =  ■ 


/ +  ’ 


cos  a2  =  ‘ 


L  —  L 


Y  W  +  iL-h) 2 


I,  is  the  length  of  the  solenoid  (Fig.  23.8),  and  R  is  the  radius 
of  the  cylindrical  coil. 


Fig.  23.8 

23.3.9  Under  the  condition  that  L  >  R,  the  magnetic  field 
inside  the  solenoid  at  points  on  its  axis  sufficiently  distant 
from  its  ends  is  characterized  by  the  values 

B  =  |x0p,rnJ,  H=nl  (in  SI  units), 

1  1 

B-  —  4itp.rn/,  H  —  —  innI  (in  Gaussian  units). 
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The  magnetic  induction  B  and  strength  H  of  the  magnetic  field 
of  a  solenoid  of  sufficient  length,  at  points  of  its  axis  coinciding 
with  its  ends,  are  equal  to 

b  =  b/j  H  =  i_nf  (in  SI  units), 

1  1 

B  =  —  2itprre/,  II  =  —  2nnI  (in  Gaussian  units). 


23.4  Interaction  ot  Conductors.  Effect  of  a  Magnetic 
Field  on  Current-Carrying  Conductors 

23.4.1  Ampere’s  force  (23.1.4),  which  acts  on  an  element  dl 
of  length  of  a  straight  conductor  carrying  the  current  Ilt  and 
is  exerted  by  a  long  straight  conductor  carrying  the  current  /2, 
located  parallel  to  the  first  conductor  at  a  distance  of  a,  is 
numerically  equal  to 

df  =  -M£.  h-dl  (in  SI  units), 

4n  a  '  ' 

1  2 II 

dF=  —  pr — ~ -dl  (in  Gaussian  units), 

where  pq  is  the  magnetic  constant  in  SI  units  (23.2.2)  and  c 
is  the  electrodynamic  constant  (Appendix  I).  The  force  dF 
is  a  force  of  magnetic  interaction  (24.1.4). 

It  is  assumed  that  the  lengths  of  the  conductors  are  many  times 
greater  than  the  distance  a  between  them,  and  that  element  dl 
is  located  far  from  the  ends  of  the  first  conductor.  It  is  moreover 
assumed  that  the  conductors  are  in  a  homogeneous  isotropic 
medium  having  the  relative  magnetic  permeability  pr. 

The  force  F  exerted  on  a  current-carrying  conductor  of  length  l 
is  equal  to 

f  =  ,2/i/2  t  (in  gI  units) 

4ji  a  ' 

1  2 II 

F=— 5-ur — — -  l  (in  Gaussian  units). 
a 
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Conductors  carrying  currents  J\  and  /2  that  flow  in  the  same 
direction  attract  each  other.  If  the  currents  flow  in  opposite 
directions,  the  conductors  repulse  each  other  (see  also  Sect. 
114.1.4). 

23.4.2  A  closed  flat  current-carrying  loop,  of  arbitrary  geometric 
shape,  placed  in  a  uniform  magnetic  held  (23.1.3),  is  subject 
to  a  moment  of  force  (t  orque)  M  equal  to 

M  =  [PmB]> 

where  pm  is  the  magnetic  moment  vector  of  a  current-carrying 
loop  (23.3.4),  and  B  is  the  vector  of  magnetic  induction  (23.1.2) 
of  the  field.  Torque  M  is  perpendicular  to  the  plane  passing 
through  vectors  pm  and  B  in  such  a  manner  that  the  shortest 
rotation  from  pm  to  B  is  seen  from  the  head  of  vector  M  to  be 
counterclockwise.  The  torque  tends  to  turn  the  loop  to  a  position 
of  stable  equilibrium,  in  which  vectors  pm  and  B  are  parallel 
to  each  other. 

The  preceding  formula  provides  for  a  definition  of  the  magnetic 
induction  B  (23.1.2):  the  magnitude  of  the  magnetic  induction 
vector  at  a  given  point  of  a  uniform  magnetic  field  is  equal  to 
l.lie  maximum  value  of  the  torque  Afmax  acting  in  the  vicinity 
of  the  point  on  a  small  flat  closed  current-carrying  loop,  having 
u  magnetic  moment  pm  of  unit  magnitude: 

(in  SI  units), 

Pm  ■ 

R  =  Mm**'  _  %ai.  (in  Gaussian  units). 

P™  1  IS 

c 


23.4.3  When  a  closed  current-carrying  loop  is  placed  into  a 
nonuniform  magnetic  field  (23.1.3)  in  which  the  magnetic 
induction  B  of  the  field  varies  over  distances  comparable  to  the 
linear  dimensions  of  the  loop,  the  loop  is  subject  to  the  force  F 
equal  to 


F  —  Pmx 


dB  .  dB  dB 


where  pmx,  pmv  and  pmz  are  the  projections  of  vector  pm  on  the 
axes  of  Cartesian  coordinates.  In  particular,  when  vector  B 
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is  along  the  OX  axis  and  depends  only  on  coordinate  x  [i.e. 
Bx  —  B  (x),  By  —  Bz  —  0  and  B  =  Bxi]  then 

pi _  dB 

1  -pmx1Tl- 

By  the  action  of  force  F  an  unfastened  current-carrying  loop 
is  attracted  into  a  region  with  a  stronger  magnetic  field. 


23.5  Total  Current  Law.  Magnetic  Circuits 

23.5.1  The  circulation  of  the  magnetic  induction  vector  B  along 
a  closed  circuit  L  is  an  integral  of  the  form: 

Bdl=<§  Bdl  cos  a, 

L  L 

where  L  is  the  closed  loop  of  arbitrary  shape,  a  is  the  angle 
between  vectors  B  and  dl,  and  dl  is  the  vector  of  an  element  of 
length  of  the  loop  in  the  direction  it  is  circuited. 

23.5.2  The  total  current  law  for  a  magnetic  field  in  vacuum  states: 
the  circulation  of  the  magnetic  induction  vector  of  a  magnetic 
field  along  a  closed  loop  in  vacuum  is  proportional  to  the  alge¬ 
braic  sum  of  the  currents  bounded  by  the  loop.  Thus 

n 

<§Bdl  =  p0  ^  Ih  (in  SI  units), 

L  fe=l 

n 

i^)Bdl  =  -^-  ^  Ik  (in  Gaussian  units), 

L 

where  p0  is  the  magnetic  constant  in  SI  units  (23.2.2),  c  is  the 
electrodynamic  constant  (23.2.2)  and  n  is  the  number  of  cur¬ 
rent-carrying  conductors  bounded  by  loop  L  of  arbitrary  shape. 
This  law  is  valid  for  current-carrying  conductors  of  any  shape 
or  size. 

In  calculating  the  algebraic  sum  of  the  currents,  a  current  is 
considered  positive  if  from  the  head  of  the  current  density 
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vector  (20.2.3)  the  circuiting  of  loop  L  is  seen  to  be  counter¬ 
clockwise.  Otherwise,  the  current  is  considered  to  be  negative. 
For  a  generalization  of  the  total  current  law  to  cover  magnetic 
Fields  in  an  arbitrary  media,  see  Sect.  26.4.2. 

23.5.3  In  contrast  to  an  electrostatic  conservative  field,  in 
which  the  circulation  of  the  strength  E  along  any  closed  path 
equals  zero  (16.1.4),  a  magnetic  field  has  a  curl  (it  is  called 
a  rotational,  vortical  or  circuital  field).  In  such  a  field  the  cir¬ 
culation  of  the  magnetic  induction  vector  B  of  the  field  along 
a  closed  path  is  not  equal  to  zero.  If  no  currents  are  bound  by 
loop  L,  the  circulation  of  vector  B  along  this  loop  equals  zero. 
But  this  does  not  alter  the  rotational  nature  of  the  magnetic 
field. 

23.5.4  The  total  current  law  can  be  written  for  either  vacuum 
or  for  an  arbitrary  medium  in  the  form  of  the  circulation  of 
field  strength  vector  H  (23.2.3)  along  an  arbitrary  closed  loop  L 
that  bounds  the  currents: 


n 

§  H«fl=  2  (in 

L  h=  1 


n  • 


SI  units), 


(in  Gaussian  units). 


If  no  currents  are  bounded  by  loop  L ,  the  circulation  of  vector  H 
along  this  loop  equals  zero. 

23.5.5  The  flux  of  the  magnetic  induction  vector  B  (magnetic 
flux )  through  a  surface  element  of  area  dS  is  the  scalar  physical 
quantity  equal  to 

=  B  dS  =  Bn  dS  —  B  dS  cos  y, 


where  dS  =  n  dS,  n  is  the  unit  vector  of  the  normal  to  the 
surface  of  area  dS,  Bn  is  the  projection  of  vector  B  on  the  direc¬ 
tion  of  the  normal  (Fig.  23.9),  and  y  is  the  angle  between  vec¬ 
tors  B  and  n.  The  magnetic  flux  <t>m  through  an  arbitrary  sur¬ 
face  S  is 


<l>m  — 


i 


B  dS  = 


s 


B  fi  dS . 


20-0186 
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In  calculating  this  integral,  the  vectors  n  of  the  normals  to  the 
elements  of  surface  dS  should  be  all  in  the  same  direction  with 
respect  to  surface  S.  Thus,  if  S  is  a 
closed  surface,  vectors  n  should  be  direct¬ 
ed  either  all  outward  or  all  inward. 
When  the  field  is  uniform  and  S  is  a 
plane  surface  perpendicular  to  vector 
B,  then  Bn  —  B  =  const  and  <I>m  = 
=  BS. 

23.5.6  The  Oslrogradsky-Gauss  theorem 
for  a  magnetic  field  states  that  the 
magnetic  flux  through  an  arbitrary  closed  surface  equals  zero. 
Thus 


BdS=.§  Bn  dS  =  0. 


This  theorem  is  a  mathematical  expression  of  the  fact  that  in 
nature  there  are  no  magnetic  charges  at  which  lines  of  magnetic 
induction  begin  or  end  (23.1.3). 

The  differential  form  of  the  Ostrogradsky-Gauss  theorem  for 
a  magnetic  field  is  one  of  Maxwell’s  equations  for  an  electro¬ 
magnetic  field  (27.4.2). 

23.5.7  A  magnetic  circuit  is  a  set  of  bodies  or  regions  of  space 
to  which  a  magnetic  field  is  confined.  For  example,  the  internal 
region  of  a  toroid  (23.3.7)  and  that  of  an  infinitely  long  solenoid 
(23.3.9)  are  magnetic  circuits.  To  strengthen  their  magnetic 
fields,  magnetic  circuits  are  made  of  materials  having  a  high 
relative  magnetic  permeability  nr  (23.2.1),  such  as  iron.  The 
design  of  magnetic  circuits,  which  are  the  essential  components 
of  electrical  machinery  and  such  electric  devices  as  transformers 
and  electromagnets,  is  based  on  the  laws  of  magnetic  circuits. 

23.5.8  Ohm's  law  for  a  closed  magnetic  circuit  (the  Hopkinson 
formula)  is 


where  (Dm  is  the  magnetic  flux,  which  is  constant  along  each 
portion  of  the  magnetic  circuit,  Fm  =  IN  is  the  magnetomotive 
force  ( mmf ),  or  magnetizing  force  (in  SI  units),  N  is  the  number 
of  turns  carrying  the  electric  current  /  for  magnetizing  the 
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circuit,  and  Rm  is  the  totalf magnetic  resistance  ( reluctance )  of 
Ike  circuit.  The  magnetic  resistance  of  a  magnetic  circuit  of 
length  lt  is 

h 

",n;=I  7^  (in  81  units)’ 

0 

where  (ir  is  the  relative  magnetic  permeability  of  the  given 
circuit,  u0  is  the  magnetic  constant  in  SI  units  (23.2.2),  and 
S  is  the  cross-sectional  area  of  the  circuit.  When  5  =  const, 
ltm i  —  Zj/(x0pr5  (in  SI  units). 

23.5.9  The  total  magnetic  resistance  Rm  of  the  parts  of  a  mag¬ 
netic  circuit  connected  in  series  is 

n 

^m=  2  Rmii 
i=t 

where  Rmi  is  the  magnetic  resistance  of  the  ith  part  of  the  mag¬ 
netic  circuit,  and  n  is  the  number  of  parts  that  make  up  the 
circuit. 

If  n  magnetic  resistances  are  connected  in  parallel,  the  total 
magnetic  resistance  Rm  of  the  circuit  is 


y  -J_  ' 

^  Rmi 
i= 1 

23.5.10  A  branch  point  of  a  magnetic  circuit  is  a  region  of  space 
or  bodies  where  there  are  more  than  two  possible  directions  of 
the  lines  of  magnetic  induction  (23.1.3). 

Kirchhoff's  first  law  for  branched  magnetic  circuits  states  that 
the  algebraic  sum  of  the  magnetic  fluxes  in  the  branches  that 
join  in  a  branch  point  is  equal  to  zero.  Thus 

i=l 

where  n  is  the  number  of  branches  joined  at  the  branch  point 
(cf.  Sect.  23.3.1). 
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A  magnetic  flux  is  considered  positive  if  the  lines  of  induction 
enter  the  branch  point,  and  negative  if  the  lines  emerge  from  it. 
23.5.11  Kirchhou's  second  law  jor  magnetic  circuits  states  that 
in  any  closed  loop  (mesh  of  a  network),  arbitrarily  chosen  in 
a  branched  magnetic  circuit  (network),  the  algebraic  sum  of  the 
products  of  the  magnetic  fluxes  by  the  magnetic  resistances  of 
the  corresponding  portions  of  the  circuit  (23.5.8)  equals  the 
algebraic  sum  of  the  magnetomotive  forces  in  this  loop  (23.5.8); 

h  k 

QmjRmi  —  T] 
i=l 

where  k  is  the  number  of  portions  that  make  up  the  closed  loop 
(cf.  21.3.2).  The  fluxes  <Dmi  and  mmf’s  /'m;  are  considered  po¬ 
sitive  if  the  directions  of  the  corresponding  lines  of  magnetic 
induction  (23.1.3)  coincide  with  the  arbitrarily  chosen  direc¬ 
tion  for  circuiting  the  loop. 


s 

2=1 


23.6  Work  Done  in  Moving  a  Current-Carrying 
Conductor  in  a  Magnetic  Field 

23.6.1  An  unfixed  current-carrying  conductor  is  moved  by 
Ampere’s  force  (23.1.4)  when  it  is  placed  in  a  magnetic  field. 
The  element  of  work  6 W*  done  by  the  Ampere  force  in  moving 
an  element,  of  length  dl,  of  a  conductor  carrying  the  current 
I  is 

=  I  dcbjn  (in  SI  units), 

1 

6JF*  =  —  / d(t> ^  (in  Gaussian  units), 


where  d<I>r*  is  the  element  of  magnetic  flux  (23.5.5)  through  the 
element  of  surface  dS  generated  by  the  element  of  the  conductor, 
of  length  dl,  in  its  small  displacement,  and  c  is  the  electrody¬ 
namic  constant  (23.2.2). 

23.6.2  In  a  small  displacement  of  a  current-carrying  conductor 
of  finite  length,  the  work  6  IF  done  by  the  Ampere  forces  is  the 
sum  of  the  elements  of  work  for  all  the  elements  of  length  of  the 
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conductor.  It  is  equal  to  the  integral  of  6 VP*  calculated  along 
I  lie  whole  length  l  of  the  conductor.  Thus 


W=  \  /d<D*  d<b^  =  Id(Sm  (in  SI  units), 

i  i 

1 

i)W  =  —Id(Dm  (in  Gaussian  units), 
c 

where  dQ>m  is  the  magnetic  flux  through  the  surface  generated 
liy  the  conductor  of  length  l  in  a  small  displacement. 

23.6.3  If  a  conductor  of  finite  length  l  carries  a  constan  tcurrent  I 
and  is  moved  a  finite  distance,  the  required  work  of  the  Ampere 
forces  for  this  motion  is 

W  =  /®m, 

where  ®m  is  the  magnetic  flux  through  the  surface  generated 
by  the  conductor  in  its  motion. 

23.6.4  The  work  done  by  Ampere  forces  in  the  displacement 
of  a  closed  flat  loop  carrying  the  constant  current  I  in  a  magnetic 
held  from  initial  position  1  to  final  position  2  is  equal  to 

Wu  ~  I  —  ^milt 

where  ®mi  and  ®m2  are  magnetic  fluxes  “linked  to  the  loop” 
in  positions  1  and  2,  i.e.  the  magnetic  fluxes  through  a  surface 
bounded  by  the  loop.  In  calculating  these  magnetic  fluxes,  the 
direction  of  normal  n  (23.5.5)  is  coordinated  with  the  direction 
of  the  current  in  the  loop  by  the  corkscrew  rule:  from  the  head 
of  the  vector  of  the  normal,  the  current  is  seen  flowing  counter¬ 
clockwise  in  the  loop. 

23.6.5  The  formulas  given  above  are  valid  for  a  current-carrying 
coil  of  N  turns  moving  in  a  magnetic  field  by  Ampere  forces,  or, 
in  general,  for  a  loop  of  arbitrary  shape.  For  example,  the  work 
done  in  a  small  displacement  of  a  coil  carrying  the  current  I 
is  equal  to 

N  N 

&W  =  I  2  d®mi  =  /d(2  ®m = 

1=1  1=1 
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N 

where  ¥  =  V  (J>m;  is  the  total  magnetic  flux  through  all  N 
i—l 

turns  of  the  coil;  it  is  called  the  flux  linkage  of  the  circuit. 
23.6.6  The  work  required  to  move  a  conductor  or  closed  loop 
carrying  a  constant  current  in  a  magnetic  field  is  done  at  the 
expense  of  the  energy  consumed  in  the  current  source  (21.1.3). 


CHAPTER  24  MOTION  OF  CHARGED  PARTICLES 
IN  ELECTRIC  AND  MAGNETIC  FIELDS 

24.1  Lorentz  Force 

24.1.1  Not  only  current-carrying  conductors  (23.1.4)  are  sub¬ 
ject  to  the  action  of  a  magnetic  field.  This  is  also  true  of  separate 
charged  particles  moving  in  the  field.  The  force  Fl,  acting  on 
an  electric  charge  q  moving  in  a  magnetic  field  at  the  velocity 
v,  is  called  the  Lorentz  force  and  is  equal  to  (see  also  Sect.  24.1.5) 

Fl  =  Q  (vBJ  (in  SI  units), 

Fl  =  -|-[vB]  (in  Gaussian  units), 

where  q  is  the  algebraic  value  of  the  moving  charge,  B  is  the 
induction  of  the  magnetic*  field  in*  which  the  charge  is  moving 
(23.1.2),  and  c  is  the  electrodynamic  constant  (23.2.2).  The 
relative  arrangement  of  vectors  v,  B  and  I,Fl  is  shown  in 
Fig.  24.1  for  positive  (q  >  0)  and  negative  (q  <  0)  charges. 
The  magnitude  of  the  Lorentz  force  is 

F L  =  qvB  sin  a, 

where  a  is  the  angle  between  vectors  v  and  B. 

24.1.2  The  Lorentz  force  is  always  perpendicular  to  the  velocity 
of  the  charged  particle  and  imparts  normal  acceleration  (1.4.6) 
to  it.  Since  the  Lorentz  force  does  not  change  the  magnitude  of 
the  velocity,  but  only  its  direction,  this  force  does  no  work  and 
the  kinetic  energy  of  the  charged  particle  remains  constant 
when  it  moves  in  a  magnetic  field. 


24.1.3  Making  use  of  the  Lorentz  force,  magnetic  induction  B 
(23.1.2)  can  be  defined  as  follows:  the  magnitude  of  the  magnet¬ 
ic  induction  vector  at  a  given  point  of  the  magnetic  field  is 
equal  to  the  maximum  Lorentz  force  FL  max  acting  on  a  unit 


Fig.  24.1 


positive  charge,  which,  at  the  given  point,  travels  at  unit  ve¬ 
locity.  Thus 

B==  max 
qv 

Here  F l  =  max  under  the  condition  that  cc  =  n/2  (24.1.2). 
See  also  Sects.  23.1.2  and  23.4.2. 

24.1.4  The  force  exerted  on  moving  charge  q2  by  moving  charge 
r/i  is  called  the -force  of  their  magnetic  interaction  ( magnetic  force). 
For  the  special  case  of  two  positive  charges  and  q2,  travelling 
in  vacuum  at  the  same  velocity  v1—  v2  =  c  along  the  OX 
axis,  the  force  Fmt  of  magnetic  interaction  is  a  force  of  attrac¬ 
tion  and  is  numerically  equal  to 

Pint  =  v2  (in  SI  units), 


where  r  is  the  distance  between  the  charges,  and  [t0  is  the  magnet¬ 
ic  constant  (23.2.2). 

The  force  Flat  of  magnetic  interaction  can  be  presented  in  the 
form 


Fiat  —  9iv 


Mi” 

4  nr2 


=  qtvB, 


312 


MOTION  OF  CHARGED  PARTICLES  IN  FIELDS  [24 


where 

4Jtr2  ‘ 

In  this  form  the  force  .Pint  coincides  with  the  Lorentz  force  at 
sin  a=l  (24.1.1)  if  we  assume  that  73  is  the  induction  of  the  mag¬ 
netic  field  (23.1.2)  set  up  by  moving  charge  qv  In  its  turn,  this 
field  acts  on  moving  charge  t/2.  Magnetic  forces  and  a  magnetic 
field  are  the  result  of  the  way  in  which  forces  are  transformed 
in  going  over  from  a  fixed  to  a  moving  frame  of  reference,  and 
follows  from  the  formulas  of  the  special  theory  of  relativity 
(5.1.1). 

Comparing  force  F\v t  with  the  force  Fe  of  electrostatic  repul¬ 
sion  between  mutually  fixed  charges  g3  and  q2  in  vacuum  (er  =  1) 
located  at  the  same  distance  r  from  each  other  (14.2.6) 

Fe==-44r^(in  SI  units)i 

(where  e0  is  the  electric  constant  in  SI  units),  we  readily  find 
that 


F int  p2 

Fe  ~  c2  ’ 

where  c  is  the  electrodynamic  constant  (23.2.2)  and  is  related 
to  e0  and  [i0  by  the  equation  e0p0  =  1/c2  (Appendix  I).  When 
the  velocity  of  the  charges  is  small  compared  to  the  velocity  of 
light  in  free  space  (u<c  c ),  the  magnetic  interaction  between 
the  moving  charges  is  considerably  less  than  their  electrostatic 
interaction.  But  in  the  case  when  the  charges  are  moving  in  an 
electrically  neutral  conductor,  the  electric  forces  are  compen¬ 
sated  (16.4.2),  leaving  only  magnetic  interaction.  This  explains 
the  magnetic  interaction  of  current-carrying  conductors  (23.4.1). 
Though  the  force  of  magnetic  interaction  between  each  pair 
of  electrons  is  small,  the  number  of  pairs  is  so  immense  that  the 
resultant  force  of  magnetic  interaction  of  parallel  current- 
carrying  conductors  is  an  appreciable  amount  (23.4.1). 

24.1.5  If,  in  addition  to  a  magnetic  field  of  induction  B,  a 
moving  electric  charge  is  also  subject  to  the  action  of  an  electric 
field  of  strength  E  (15.1.2),  the  resultant  force  F  applied  to  the 
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charge  is  equal  to  the  vector  sum  of  the  force  Fe  =  qE,  exerted 
fin  the  charge  by  the  electric  field  and  the  Lorentz  force  (24.1.1), 

F  — qE  +  q[vB]  (in  SI  units), 

F  =  qE  +-|-  [vB]  (in  Gaussian  units). 


The  last  expression  is  also  called  the  Lorentz  force  and, 
sometimes,  the  generalized  Lorentz  force  or  the  Lorentz  formula. 
24.1.6  By  the  action  of  the  Lorentz  force  (24.1.1),  a  charged 
particle  travels  in  a  circular  path  of  constant  radius  r  in  a  uni¬ 
form  magnetic  field  perpendicular  to  the  particle’s  velocity. 
This  circular  path  is  in  a  plane  perpendicular  to  vector  B, 
and  the  Lorentz  force  is  a  centripetal  one  (2.4.3).  The  radius 
of  the  circular  path  is 


r 


r 


mv 

-t— p—  (in  SI  units), 

Iql  B 

~r~rv."  (in  Gaussian  units), 
\q\B 


where  |  q  |  is  the  absolute  value  of  the  particle’s  charge,  m  is 
Ihe  mass  of  the  particle,  v  is  its  velocity,  B  is  the  magnetic  field 
induction  and  c  is  the  electrodynamic  constant  (23.2.2).  If 
the  particle  travels  in  the  plane  of  the  drawing  (Fig.  24.2),  its 
deflection  in  a  field  perpendicular  to  the  velocity  and  directed 
from  behind  the  drawing  depends  upon  the  sign  of  the  charge. 
This  is  the  basis  for  determining  the  sign  of  the  charge  of  a 
particle  travelling  in  a  magnetic  field. 

The  period  T  of  revolution  (1.5.5)  of  a  charged  particle  in  a 
uniform  magnetic  field  (23.1.3)  is  independent  of  its  velocity 
(when  c): 

T  =  ^]f  -j^y  (in  SI  units), 

T  =  --  (in  Gaussian  units), 

B  |q| 
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The  principle  of  cyclic  orbit  accelerators  of  charged  particles 
(24.4.6)  is  based  on  the  aforesaid. 

24.1.7.  If  the  velocity  vector  v  of  a  charged  particle  makes 
the  angle  a  with  the  direction  of  vector  B  of  a  uniform  magnetic 


field,  the  particle  travels  along  a  helix  (Fig.  24.3)  of  radius  r 
and  pitch  h  equal  to 


m  v  sin  a 

r^lq\  B 


2it  m 

'b'TtT 


ycosa  (in  SI  units), 


me  esina  ,  2n  me  ..  . 

r=  — j — : - = —  ,  h  =  ncosa  (in  Gaussian  units). 

Iff  I  B  B  |  <7 1 


If  this  motion  occurs  in  a  nonuniform  magnetic  field  (23.1.3) 
whose  magnetic  induction  increases  in  the  direction  of  motion 
of  the  particle,  then  r  and  h  decrease  with  an  increase  in  B. 
This  phenomenon  is  used  for  focusing  charged  particles  in  a 
magnetic  field. 


24.2  Hail  Effect 

24.2.1  The  Hall  effect  refers  to  the  setting  up  of  a  transverse 
electric  field  and  potential  difference  in  a  current-carrying  metal 
or  semiconductor  placed  in  a  magnetic  field  which  is  perpen¬ 
dicular  to  the  direction  of  the  current  density  vector  (20.2.3). 
The  Hall  effect  results  from  the  deflection  of  electrons  moving 
in  a  magnetic  field  by  the  action  of  the  Lorentz  force  (24.1.1). 
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Shown  in  Fig.  24.4a  are  the  directions  of  the  magnetic  induction 
B,  current  density  j,  electron  velocity  v,  Lorentz  force  Fl  and 
the  signs  of  the  charges  accumulated  on  the  opposite,  upper  and 
lower,  sides  in  the  case  of  a  metal  or  electron  (n-type)  semicon¬ 
ductor  (41.10.2).  In  a  hole  (p-type)  semiconductor  (41.10.3) 
the  signs  of  the  charges  on  the  surfaces  are  the  opposite  of  those 


H- 


(a)  (b) 


Fig.  24.4 

in  the  preceding  case  (Fig.  24.46).  The  charges  are  deflected  by 
the  magnetic  field  until  the  action  of  the  force  in  the  transverse 
electric  field  counterbalances  the  Lorentz  force. 

24.2.2  The  equilibrium  potential  difference  in  the  Hall  effect  is 

AF  =  F1-F2  =  /?H^, 

where  I  is  the  current,  B  is  the  magnetic  field  induction  (23.1.2), 
d  is  the  linear  dimension  of  the  metal  or  semiconductor  in  the 
direction  of  vector  B,  and  7? n  is  the  Hall  coefficient. 

The  strength  Eh  of  the  transverse  electric  field  in  the  Hall 
effect  is 

Eh  =  Rn  [Bj], 

where  j  is  the  current  density  vector. 
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24.2.3  For  metals  and  extrinsic  semiconductors  (41.10.5), 
having  a  single  kind  of  conduction,  the  Hall  coefficient  equals 


A 

i?H= -  (in  SI  units), 

nBq 


J?h=  -  (in  Gaussian  units), 

cn0q 


where  c  is  the  electrodynamic  constant  (23.2.2),  q  and  n0  are 
the  charge  and  concentration  of  the  current  carriers,  and  A  »  1 
is  a  dimensionless  factor  depending  upon  how  the  current- 
carrier  velocities  are  statistically  distributed.  The  sign  of  the 
Hall  coefficient  coincides  with  that  of  the  current-carrier  charge 
q.  A  measurement  of  the  Hall  coefficient  of  a  semiconductor 
indicates  its  type  of  electrical  conduction.  For  an  re-type  semi¬ 
conductor  (electron  conduction)  (41.10.2),  q  =  —  eandi?H  <  0; 
for  a  p-type  semiconductor  (hole  conduction)  (41.10.3),  q  =  e 
and  Rn  >  0. 

If  both  types  of  electrical  conduction  are  observed  in  a  semi¬ 
conductor,  the  predominant  conduction  can  be  determined  from 
the  sign  of  the  Hall  coefficient.  The  formula  for  R  h  given  in 
this  subsection  is  inapplicable  in  such  cases. 

24.2.4  Provided  the  type  of  electrical  conduction  is  known, 
a  measurement  of  the  Hall  coefficient  enables  the  concentra¬ 
tion  re„  of  the  current  carriers  to  be  determined.  For  example, 
the  concentration  of  conduction  electrons  equals  the  concen¬ 
tration  of  atoms  in  monovalent  metals.  This  means  that  there 
is  one  free  electron  per  atom  in  the  electron  gas  (20.3.1)  of  the 
metal.  A  known  concentration  of  current  carriers  enables  the 
mean  free  path  (X)  of  the  electron  in  the  metal  to  be  assessed. 
From  the  formula  of  (20.3.4), 

.  _  2 xm  ( u ) 


we  find  that  ( X )  x  10-8  m,  which  exceeds  the  lattice  constants 
of  the  metal  by  two  orders  of  magnitude  (see  also  Sects.  20.3.7 
and  41.5.5). 
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24.3  Charge-to-Mass  Ratio  of  Particles.  Mass 
Spectroscopy 

24.3.1.  The  charge-to-mass  ratio  of  a  particle,  qlm,  is  a  character¬ 
istic  of  charged  particles.  Experimental  determination  of  this 
charge-to-mass  ratio  is  based  on  measuring  the  deflection  of  the 
particles  in  electric  and  magnetic  fields  acting  jointly  on  them. 
The  mass  of  the  particles  is  determined  from  the  charge-to-mass 
ratio  qlm  and  the  known  charge  q, 

24.3.2  The  mass  spectrum  of  particles  is  the  set  of  their  mass 

values.  Applying  mass  sjpectrometers  and  mass  spectrographs, 
special  instruments  used  in  * 

mass  spectrometry,  the  rela-  g  Jlr- D 

tive  concentrations  of  iso-  J 

topes  of  chemical  elements  {  }  £  +  *  */f\  Jf 

(42.1.3)  and  their  masses  can  T^T — Ox 

be  measured  with  exceptio-  I  '  ' 

nally  high  accuracy. 

24.3.3  In  the  Aston  mass 
spectrograph  (Fig.  24.5), 

which  was  employed  to  d is-  pj,r.  24.5 
cover  the  isotopes^  of  various 
chemical  elements,  the  par¬ 
ticles  are  deflected  in  the  uniform  electric  field  of  capacitor 
C  and  the  magnetic  field  of  coil  M,  the  fields  being  perpen¬ 
dicular  to  each  other.  The  lower  the  velocity  of  the  particles 
and  the  greater  their  charge-to-mass  ratio,  the  more  strongly 
they  are  deflected  toward  the  capacitor  plates.  In  a  uniform 
magnetic  field  with  the  induction  B  directed  into  the  drawing, 
the  particles  travel  along  circular  arcs  (24.1.6).  The  higher  the 
velocity  of  the  particles  and  the  lower  their  charge-to-mass  ratio, 
the  greater  the  radii  of  these  arcs.  In  a  magnetic  field  a  beam 
of  particles  is  split  into  several  beams,  each  corresponding  to 
a  definite  charge-to-mass  ratio.  The  magnetic  field  focuses  par¬ 
ticles  having  different  velocities,  but  the  same  charge-to-mass 
ratio,  on  a  single  point.  The  magnetic  induction  B  is  chosen  so 
that  the  charged  particles  are  focused  on  a  photographic  plate 
AD,  which  is  perpendicular  to  the  plane  of  the  drawing.  A 
series  of  narrow  parallel  lines  obtained  on  the  plate  corresponds 
to  the  various  values  of  the  particle  charge-to-mass  ratio.  In 
Fig.  24.5,  line  corresponds  to  particles  with  a  higher  charge- 
to-mass  ratio,  and  line  g2  to  particles  with  a  lower  one.  If 
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the  charge-to-mass  ratio  of  the  particles  of  line  cSi  is  known,  as 
well  as  the  distance  between  lines  and  and  the  parameters 
of  the  instrument,  the  charge-to-mass  ratio  of  the  particles 
corresponding  to  line  ©2  can  be  determined. 

If  the  parallel-plate  capacitors  are  replaced  by  tubular  ones 
and  special  electromagnets  are  used  that  produce  a  plane-paral¬ 
lel  beam  of  particles  entering  the  magnetic  field,  we  can  achieve 
double  focusing  of  the  particles  with  respect  to  both  energy  and 
direction.  With  sufficient  intensity  of  the  lines  on  the  photo¬ 
graphic  plate,  this  technique  ensures  high  precision  in  particle 
mass  measurements.  For  instance,  the  masses  of  the  ions  of 
light-weight  elements  can  be  measured  to  an  accuracy  within 
10~4  per  cent. 

24.3.4  The  relative,  concentrations  of  the  isotopes  of  chemical 
elements  in  their  natural  mixtures  are  measured  by  instruments 
called  mass  spectrometers  that  electrically  record  the  ion  currents. 
Used  in  mass  spectrometers  are  ion  beams  in  which  the  kinetic 
energies  of  the  ions  are  close  in  value.  These  are  called  mono¬ 
chromatic  (monoenergetic)  beams/  and  are  produced  by  ion 
sources  of  special  design.  Even  when  they  are  strongly  divergent 
and  contain  a  great  number  of  ions,  such  beams  focus  well  in 
a  transverse  magnetic  field.  This  raises  the  precision  with  which 
the  concentrations  of  various  isotopes  can  be  measured. 


24.4  Charged  Particle  Accelerators 

24.4.1  Special  installations  for  obtaining  directed  beams  of 
charged  particles  (electrons,  protons,  atomic  nuclei  and  ions 
of  chemical  elements)  having  an  extremely  high  kinetic  energy, 
under  laboratory  conditions  are  called  accelerators.  According 
to  the  shape  of  the  path  of  the  particles  and  the  mechanism  for 
accelerating  them,  distinction  is  made  between  linear,  cyclic 
and  induction  accelerators.  In  linear  accelerators  (also  called  lin- 
acs)  the  paths  of  motion  of  the  particles  are  close  to  straight 
lines;  in  the  cyclic  and  induction  accelerators  the  paths  of  the 
particles  are  circles  or  unwinding  spirals. 

24.4.2  The  energy  of  the  accelerated  particles  is  increased 
by  the  electric  field  set  up  in  the  accelerator.  Depending  on  the 
type  of  accelerator,  this  field  can  be  electrostatic  (15.1.2), 
induced  (25.1.2)  or  high-frequency  alternating.  The  principle 
of  the  betatron,  based  on  the  action  of  an  induced  electric  field, 
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is  discussed  in  Sect.  27.2.3.  In  a  linear  electrostatic  accelerator, 
a  charged  particle  passes  only  once  through  an  accelerating 
electric  field  with  a  potential  difference  (Fj  —  F2).  If  q  is  the 
charge  of  the  particle,  the  energy  acquired  by  the  particle  in 
the  accelerator  (15.2.7)  is  equal  to 

E  =  q  (Fj  -  F2). 

In  such  an  accelerator  the  electric  field  is  set  up,  for  example, 
by  a  Van  de  Graaff  high-voltage  electrostatic  generator,  operat¬ 
ing  by  the  principle  of  multiple  transfer  of  charges  to  a  hollow 
conductor  (16. 4. 3). This  raises 
the  potential  of  the  con¬ 
ductor  to  values  limited  only 
by  the  leakage  of  the  charge 
from  the  conductor. 

24.4.3  In  linear  resonant  ac¬ 
celerators,  the  energy  of  the 
charged  particles  is  increased 
by  an  alternating  electric 
field  of  super-high  frequency, 
which  varies  in  synchronism 
with  the  motion  of  'the  par¬ 
ticles.  Using  this  type  of 
accelerator,  electrons  can  be 
accelerated  to  an  energy  of 
the  order  of  dozens  of  GeV 
over  a  path  length  of  sev-  Fig.  24.6 
eral  kilometres. 

24.4.4  Protons,  deuterons  and  other,  heavier,  charged  par¬ 
ticles  are  accelerated  by  resonant  cyclic  accelerators,  in  which  a 
particle  repeatedly  passes  through  an  electric  field  synchronized 
with  its  motion,  increasing  its  energy  each  time.  A  strong  trans¬ 
verse  magnetic  field  controls  the  motion  of  the  particles  and 
returns  them  periodically  to  the  region  of  the  accelerating  elec¬ 
tric  field.  The  particles  pass  definite  points  of  the  alternating 
electric  field  approximately  when  the  field  is  in  the  same  phase 
(“in  resonance”). 

24.4.5  In  the  cyclotron,  the  simplest  resonant  cyclic  accelerator, 
an  accelerating  alternating  electric  field  is  set  up  in  the  gap 
between  the  two  halves  (dees)  N  and  M  (Fig.  24.6)  of  a  cylindri¬ 
cal  box.  The  dees  are  placed  in  a  flat  closed  chamber  located  be- 
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tween  the  poles  of  a  strong  electromagnet,  whose  magnetic 
field  is  perpendicular  to  the  plane  of  the  drawing.  The  alternat¬ 
ing  electric  field  in  the  gap  between  the  dees  is  set  up  by  an 
electric  generator  whose  terminals  are  connected  to  electrodes 
m  and  n. 

24.4.6  A  particle  is  accelerated  in  the  gap  between  the  dees  M 
and  N  each  time  it  crosses  this  gap  after  describing  semicircles 
of  continuously  increasing  radii  due  to  the  action  of  the  magnet¬ 
ic  field.  The  same  time  is  required  for  each  semicircle  (24.1.6). 
To  continuously  accelerate  the  particles  it  is  necessary  to  com¬ 
ply  with  the  condition  of  synchronism  (condition  of  “resonance”): 
T  =  T0,  where  T  is  the  period  of  revolution  of  the  particle  in 
the  magnetic  field  (24.1.6),  and  T0  is  the  period  of  oscillation 
of  the  magnetic  field  (28.1.2).  This  condition  is  violated  at  rel¬ 
ativistic  velocities  v  of  the  particle,  commensurate  with  the 
velocity  of  light  c  in  free  space.  At  such  high  velocities,  the 
mass  m  of  the  particle  increases  with  the  velocity  (5.6.1)  as 
does  the  period  T  (24.1.6). 

24.4.7  The  feasibility  of  accelerating  charged  particles  trav¬ 
elling  at  relativistic  velocities  in  cyclic  accelerators  follows 
from  the  principle  of  phase  stability  ( autophasing ),  which  states 
that  each  deviation  of  the  period  T  from  the  resonance  value 
T0  (24.4.6)  leads  to  such  a  change  in  the  energy  E  of  the  particle 
at  each  acceleration  that  T  varies  in  the  neighborhood  of  T0 
and,  on  an  average,  remains  equal  to  T0.  Thus 


T0 


2it  m  _  2n  E 

TF^Tin5’ 


where  E  —  me2  (5.7.2),  m  =  m0/  Y 1  —  v2lc2  (5.6.1),  m0  is  the 
rest  mass  (5.6.1)  of  the  particle,  and  c  is  the  velocity  of  light 
in  free  space;  the  remaining  notation  is  the  same  as  in  Sect. 
24.1.6. 

If,  for  example,  as  a  result  of  the  increase  in  mass  m  and  the 
period  T,  the  particle  enters  the  gap  between  the  dees  (Fig.  24.6) 
by  the  action  of  the  decelerating  instead  of  accelerating  electric 
field,  the  reduction  in  particle  velocity  leads  to  the  reduction 
in  T  and  the  equality  T  =  T0  is  reached  again. 

24.4.8  It  follows  from  the  principle  of  phase  stability  (24.4.7) 
that  at  a  sufficiently  slow  increase  in  the  period  T0  of  the  elec¬ 
tric  field,  the  period  T  of  revolution  of  the  particle  in  the  mag¬ 
netic  field  of  the  accelerator  increases  correspondingly.  This 
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loads  to  an  increase  in  the  average  energy  ( E )  of  all  the  particles 
because  at  constant  induction  of  the  magnetic  field  any  increase 
in  T  is  possible  only  owing  to  the  increase  in  mass  as  the  veloc¬ 
ity  of  the  particles  increases. 

This  principle  has  been  applied  in  an  accelerator  called  the 
synchrocyclotron  (also  frequency-modulated  cyclotron  and  phaso¬ 
tron).  This  device  has  a  constant  magnetic  field,  whereas  the 
frequency  v0  =  1 IT0  (28.1.2)  of  the  alternating  electric  field 
varies  slowly  with  the  period  t  »  T0.  The  radius  of  the  orbit 
of  the  particles  in  the  synchrocyclotron  increases  with  their 
velocity  (24.1.6).  This  requires  a  corresponding  increase  in  the 
size  of  the  synchrocyclotron  to  reach  the  maximum  energy  of 
the  charged  particles.  The  synchrocyclotron  built  and  operated 
in  the  USSR,  accelerating  protons  to  an  energy  of  680  MeV, 
lias  a  magnet  with  poles  6  m  in  diameter  and  a  mass  of  7  X  103 
metric  tons. 

24.4.9  The  synchrotron  is  an  accelerator  in  which  the  frequency 
of  the  accelerating  electric  field  is  constant,  and  the  induction 
B  of  the  magnetic  field  varies  with  time.  The  period  of  revolu¬ 
tion  of  a  particle  in  the  magnetic  field  of  a  synchrotron  (24.4.6) 
is 

r  2  n  El 
|  e  |  ca  B  ’ 


where  e  is  the  charge  of  the  electron,  and  E  is  its  energy.  The 
condition  of  synchronism  (24.4.6)  is  complied  with  in  a  syn¬ 
chrotron  at  T0  =  const,  provided  that  the  induction  of  the 
magnetic  field  increases  proportionally  to  the  particle’s  energy. 
Thus 


2i  = 


2  ji  E 

FF  r„  ’ 


where  T 0  is  the  period  of  a  high-frequency  accelerating  electric 
field. 

The  following  condition  is  complied  with  in  a  synchrotron; 


m 


eTt 


o _ 


2jx 


=  const. 


The  particles  travel  along  orbits  close  to  circular  ones  (24.1.6) 
and,  consequently,  a  synchrotron  is  equipped  with  annular 
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magnets  that  set  up  a  magnetic  field  in  a  comparatively  narrow 
region  of  the  circular  orbit. 

24.4.10  A  proton  synchrotron,  the  most  powerful  accelerator  of 
protons,  combines  the  principles  applied  in  the  synchrocyclo¬ 
tron  (24.4.8)  and  in  the  synchrotron  (24.4.9).  In  this  accelera¬ 
tor,  the  frequency  v0  of  the  accelerating  electric  field  is  reduced 
and  the  induction  B  of  the  magnetic  field  is  increased  simulta¬ 
neously  and  in  coordination.  The  protons  being  accelerated 
travel  along  a  circular  orbit  of  constant  radius.  Therefore,  the 
magnetic  field  is  set  up  by  an  annular  electromagnet  as  in  the 
synchrotron. 

24.4.11  The  condition  for  simultaneous  maintenance  of  vertical 
(axial)  and  radial  stability  of  the  design  circular  orbit  in  the 
synchrotron  and  proton  synchrotron  is  the  variation  of  magnet¬ 
ic  induction  B  near  the  design  orbit  according  to  the  law 

D  const 

tf -  - - -  . 

rn 

where  r  is  the  distance  from  the  centre  of  the  orbit,  and  n  varies 
within  the  limits  0  <  n  <  1  (condition  of  weak  focusing  in  the 
accelerator).  With  an  increase  in  the  maximum  energy  Emax 
acquired  by  the  particles  in  an  accelerator  having  weak  focus¬ 
ing,  the  required  mass  of  the  electromagnet  increases  ap¬ 
proximately  in  proportion  to  E max. 

24.4.12  Accelerators  having  strong  focusing  are  resorted  to  for 
increasing  the  maximum  energy  Amax  of  particles  accelerated 
in  synchrotrons  and  proton  synchrotrons.  In  these  accelerators, 
two  kinds  of  sector-shaped  magnets  are  used  alternately  along 
the  almost  circular  orbit  of  particle  travel.  In  one  kind  of  sector 
the  magnetic  field  varies  according  to  the  law  of  Sect.  24.4.11, 
in  which  n  is  much  less  than  zero  (for  example,  n  =  —  100), 
and  in  the  other  kind  re>  1.  Sectors  of  the  first  type  provide 
for  radial  focusing  of  the  beam  of  particles  being  accelerated; 
sectors  of  the  second  kind,  for  vertical  focusing.  The  applica¬ 
tion  of  strong  focusing  enables  substantial  reductions  to  be  ob¬ 
tained  in  the  overall  size  of  the  accelerator,  the  mass  of  the 
electromagnet  and  the  total  cost  of  the  installation.  This  method 
is  also  called  alternating-gradient  focusing. 

24.4.13  To  increase  the  share  of  energy  utilized  by  accelerated 
particles  for  various  nuclear  reactions  (42.9.1),  the  bombard¬ 
ment  of  stationary  targets  by  high-energy  particles  is  replaced 
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by  the  colliding  beam  technique.  It  follows  from  the  laws  of 
conservation  of  energy  (3.4.3)  and  momentum  (2.7.1)  that  in 
bombarding  a  stationary  target,  the  share  of  the  kinetic  energy 
l'\  of  the  incident  particle,  utilized  in  the  nuclear  reaction, 
decreases  as  E k  is  increased.  When  the  technique  of  colliding 
beams  is  used,  the  total  momentum  of  the  particles  after  col¬ 
lision  is  reduced  and  the  share  of  their  useful  energy  is  increased. 
Assume,  for  example,  that  in  an  accelerator  with  colliding  beams 
of  protons  the  energy  of  the  protons  in  each  beam  is  equal  to 
20  GeV.  The  total  momentum  of  two  protons  colliding  at  equal 
and  opposing  velocities  equals  zero.  But  the  collision  energy 
of  such  two  particles  reaches  50  GeV.  To  obtain  the  same  colli¬ 
sion  energy  in  bombarding  a  stationary  hydrogen  target,  a  beam 
of  protons  with  energy  of  the  order  of  1400  GeV  is  required. 


CHAPTER  25  ELECTROMAGNETIC  INDUCTION  * 


25.1  Basic  Law  of.  Electromagnetic  Induction 

25.1.1  The  phenomenon  of  electromagnetic  induction  is  the  sett¬ 
ing  up  of  an  induced  electric  field  in  a  conducting  loop,  or  circuit, 
located  in  a  varying  magnetic  field.  The  energy  measure  of  this 
induced  field  is  the  electromotive  force  %\  of  electromagnetic  in¬ 
duction.  If  the  loop  is  closed,  the  induced  electric  field  causes 
ordered  motion  of  electrons  in  the  loop,  i.e.  a  current,  called  the 
induced  current,  is  produced. 

25.1.2  Faraday's  law  of  electromagnetic  induction  states  that  the 
emf  of  electromagnetic  induction  in  a  loop,  or  circuit,  is 
proportional  to  the  rate  of  change  of  the  magnetic  flux  <T>m 
(23.5.5)  through  the  surface  area  bounded  by  this  loop.  Thus 

'f,\  =  — (in  SI  units), 

#1= — -  d<^*-  (in  Gaussian  units). 
cat 


*  In  all  the  formulas  written  in  Gaussian  units,  c  is  the  electro¬ 
dynamic  constant  (23.2.2). 


21* 


ELECTROMAGNETIC  INDUCTION  [25 


324 


The  cause  of  the  change  in  the  magnetic  flux  is  unessential. 
It  may  be  due  to  distortion  of  the  loop  or  its  motion  in  the  ex¬ 
ternal  magnetic  field  or  to  any  other  reasons  for  the  change 
in  the  magnetic  field  with  time.  When  a  closed  conducting 
circuit  (or  loop)  moves  in  a  magnetic  field,  emf’s  are  induced 
in  all  of  the  parts  of  the  loop  that  cut  the  lines  of  magnetic  in¬ 
duction.  The  algebraic  sum  of  these  emf’s  (25.1.3)  is  equal  to 
the  total  emf  of  the  loop.  The  work  that  must  be  done  to  move 
a  closed  loop,  or  circuit,  in  a  magnetic  field  is  equal  to  the  work 
of  the  current  induced  in  the  loop. 

In  calculating  <t>m  and  the  directions  around  the  loop  (21.3.2) 
and  of  the  outward  normal  n  (23.5.5  and  23.5.8)  are  to  be  coordi¬ 
nated  so  that  from  the  head  of  vector  n  the  direction  around 
the  circuit  (or  loop)  is  seen  to  be  counterclockwise.  If  a  closed 
circuit  consists  of  N  turns  connected  in  series  (for  example, 
the  solenoid  in  Sect.  23.3.7),  then  the  magnetic  flux  On,  in 
Faraday’s  law  is  to  be  replaced  by  the  flux,  or  magnetic,  link¬ 
age  VF  of  the  circuit  (23.6.5): 

, m 

$i= - (in  SI  units), 

1  dT 

- —  (in  Gaussian  units). 

cdt  ' 

25.1.3  The  emf  of  electromagnetic  induction  in  a  loop  (or  cir¬ 
cuit)  is  said  to  be  positive  if  the  magnetic  moment  vector  pm 
of  the  corresponding  induction  current  (23.3.4)  makes  an  acute 
angle  with  the  lines  of  magnetic  induction  of  the  field  that  in¬ 
duces  this  current.  Otherwise,  is  said  to  be  negative.  In 
Fig.  25.1a,  the  emf  <£i  <  0,  whereas  in  Fig.  25.16,  %\  >  0. 

25.1.4  The  minus  sign  in  the  formula  for  Sfi  complies  with  the 
Lenz  law ,  which  states  that  upon  any  change  in  the  magnetic 
flux  through  a  surface  bounded  by  a  closed  loop,  the  current 

•  induced  in  the  loop  is  always  of  a  direction  in  which  the  magnet¬ 
ic  field  set  up  by  the  induced  current  opposes  the  change  in  the 
magnetic  flux  that  induced  the  current. 

25.1.5  An  emf  of  electromagnetic  induction  is  developed  in 
a  part  of  a  conductor  which,  in  its  motion,  cuts  lines  of  induc¬ 
tion  of  a  magnetic  field  (23.1.3).  In  the  caseillustrated'in  Fig.  25.2, 
the  conduction  electrons  of  the  metal  (20.3.1)  are  subject  to  the 
Lorentz  force  (24.1.1).  Thus  Fl  =  —  e  [(v  +  v')  B],  where  v 
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is  the  velocity  of  the  part  A  C  of  the  conductor  in  the  magnetic 
field,  whose  induction  vector  B  is  perpendicu'ar  to  the  plane 
passing  through  the  part  of  the  conductor  and  the  velocity 


lb) 


at, 


vector  of  its  motion.  The  electrons  travel  orderly  along  the 
conductor  at  the  velocity  v'  by  the  action  of  the  component  of 
the  Lorentz  force  tangent  to  the  conductor  in  the  direction  from 


Fig.  25.2 


A  to  C.  The  motion  of  the  electrons  ceases  when  the  electro¬ 
static  field  set  up  in  the  conductor  and  acting  on  the  electrons 
with  the  force  eE  counterbalances  the  Lorentz  force. 
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According  to  Ohm’s  law  for  an  open  circuit  (at  1  =  0)  (21.2.5), 
the  equilibrium  potential  difference  AF  =  VA  —  Vc  between 
points  A  and  C  that  is  established  at  v'  =  0  is  equal  to 

AF  =  VA  -  Vc  =  - 

where  %\  is  the  electromotive  force  of  induction,  because  there 
are  no  sources  of  electrical  energy  in  part  AC  of  the  conductor. 
The  emf  of  electromagnetic  induction  in  a  piece  of  conductor 
of  length  l,  moving  at  the  velocity  v,  is 

gj  =  —  Bvl, 

or 

$ i= — (*n  SI  units), 

where  dOm/dt  is  the  ratio  of  the  magnetic  flux  through  the 
surface  generated  by  the  conductor  in  its  motion  during  an  in¬ 
finitesimal  length  of  time  to  the  value  dt  of  this  length  of  time 
or,  in  other  words,  the  velocity  at  which  the  conductor  cuts 
the  lines  of  induction  of  the  magnetic  field  (cf.  Sect.  25.1.2). 
25.1.6  The  phenomenon  of  electromagnetic  induction  in  sta¬ 
tionary  closed  conducting  circuits,  located  in  an  external  varying 
magnetic  field,  cannot  be  explained  by  means  of  the  Lorentz 
force,  which  has  no  effect  on  fixed  charges  (24.1.1). 

The  phenomenon  of  electromagnetic  induction  in  stationary 
conductors  is  due  to  the  fact  that  the  varying  magnetic  field 
excites  a  rotational  electric  field.  The  circulation  of  the  strength 
vector  of  this  field  along  the  closed  path  L  of  the  conductor 
(16.1.4)  is  the  emf  of  electromagnetic  induction.  Thus 

%i=  §Edl=— ^2-  (in  SI  units), 

L 

§1  d(T) 

Edl= - —2b  (in  Gaussian  units), 

c  at 
L 

where  the  partial  derivative  0<J>m/d t  takes  into  account  the 
dependence  of  the  magnetic  induction  flux  only  on  time.  The 
choice  of  the  direction  of  normal  n  in  calculating  the  magnetic 
flux  is  discussed  in  Sect.  25.1.2, 
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25.1.7  The  magnitude  q  of  the  electric  charge  passing  through 
the  cross  section  of  the  wire  of  the  turn  in  which  the  current  is 
induced  is 

q  =  _ - m  m  (in  SI  units), 

R 

{  o' —  <t>" 

q  =  — - — — -  (in  Gaussian  units), 

where  (Dm  and  <t>"m  are  the  values  of  the  magnetic  flux  through 
the  area  of  the  turn  in  its  initial  and  final  positions,  and  R  is 
the  electrical  resistance  of  the  turn. 


25.2  Phenomenon  of  Self-Induction 

25.2.1  Self-induclion  is  the  production  of  an  induced  electric 
field  in  a  circuit  as  a  result  of  the  variations  of  current  in  the 
same  circuit.  The  energy  characteristic  of  the  induced  field  is 
the  emf  of  self-induction  gs. 

25.2.2  The  intrinsic_magnetic  field  of  the  current  in  the  circuit 
produces  the  magnetic  flux  Oms  through  the  surface  S  bounded 
by  the  current-carrying  loop.  Thus 

(  bms  =  ^  7in  dS , . 

S 

where  Bn  is  the  projection  of  the  induction  vector  B  of  the  mag¬ 
netic  field  (23.1.2)  on  the  normal  n  to  the  element  of  surface  dS. 
The  quantity  Oms  is  called  the  magnetic  flux  of  self-induction  of 
the  loop.  Making  use  of  the  Biot-Savart-Laplace  law  (23.2.2), 
d>ms  can  be  calculated  for  the  case  in  which  the  loop  is  in  a 
nonferromagnetic  medium  (26.5.2): 

®ms  —  I  -jjr-r  [  dS  Jp  [f/1  r]n  (in  SI  units), 

S  L 

(l)ms  =  yl  ^  dS  [dl  r]n  (in  Gaussian  units), 

s  l 
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or 

®ms  =  LI  (in  SI  units), 

1 

0ms=  —  LI  (in  Gaussian  units), 

where  L  is  called  the  self-inductance  of  the  loop  and  is  deter¬ 
mined  by  the  formulas 

L  =  -|£-  ^  dS  §  [dl  r]n  (in  SI  units), 

S  L 

L  —  ^  dS  [dl  r]n  (in  Gaussian  units). 

S  L 

Here  p0  is  the  magnetic  constant  in  SI  units  (23.2.2),  jxr  is  the 
relative  magnetic  permeability  of  the  medium,  r  is  the  radius 
vector  from  element  dl  of  the  loop  to  element  dS  of  surface  S, 
bounded  by  the  loop,  and  subscript  n  indicates  the  projection 
on  the  normal  to  the  element  dS. 

25.2.3  The  self-inductance  L  of  the  loop  is  numerically  equal 
to  the  self-induction  flux  at  a  current  of  /=  1  A  (in  SI  units) 
or  I  =  c  (in  Gaussian  units).  Self-inductance  L  depends  only 
upon  the  geometric  shape  and  dimensions  of  the  loop,  as  well 
as  on  the  magnetic  properties  of  the  medium  it  is  located  in. 
The  self-inductance  of  a  sufficiently  long  solenoid  (23.4.8)  is 

L—  ^  =  p0prre2F  (in  SI  units), 

L—  - — -j - =4nj.i,-n2V  (in  Gaussian  units). 


where  l  and  S  are  the  length  and  cross-sectional  area  of  the  so¬ 
lenoid,  N  is  the  total  number  of  turns,  n  =  Nil  is  the  number 
of  turns  per  unit  length,  and  V  =  SI  is  the  volume  of  the  so¬ 
lenoid. 
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25.2.4  The  expression  for  the  emf  of  self-induction  follows  from 
Faraday’s  law  (25.1.2): 

®s=-^3p=— ir(L/> (in  SI  units)’ 

$s  =  —  ^  =  — p--^-  (LI)  (in  Gaussian  units). 

For  a  rigid  (undeformable)  loop  in  a  nonferromagnetic  medium 
(26.5.2)  L  =  const.  Hence 

$s—  —  (in  SI  units), 

L  dl 

Gp  = - (in  Gauss*an  units). 

25.2.5  The  electromotive  force  of  self-induction  causes  a 
self-induction  current  to  How  in  the  loop.  According  to  the  Lenz 


Fig.  25.3  Fig.  25.4 


law  (25.1.4),  this  current  counteracts  the  change  in  the  main 
current  in  the  circuit,  impeding  its  increase  or  decrease.  The 
self-inductance  of  the  loop  (25.2.3)  is  a  measure  of  its  inertia 
to  changes  of  the  current. 

25.2.6  The  law  for  the  change  of  the  current  in  closing  and  open¬ 
ing  a  circuit,  i.e.  with  a  nonsteady  state  in  the  circuit,  is 

/=V-m/L+f  (l-e-W), 
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where  /„  is  the  current  at  the  initial  instant  of  time  ( t  =  0), 
R  is  the  electrical  resistance  of  the  circuit,  L  is  its  self-induc¬ 
tance,  and  6  is  the  algebraic  sum  of  the  einf’s  of  the  sources  of 
electrical  energy  connected  into  the  circuit  (21.2.2). 

When  the  source  of  emf  is  switched  in  there  is  no  initial  current 
in  the  circuit  (/0  =  0): 

/=  1.(1  —  -"*/^. 

The  current  in  the  circuit  increases  from  zero  to  the  value  %lR, 
corresponding  to  that  of  the  constant  current.  The  higher  the 


Sw 


ratio  RlL  (Fig.  25.3),  the  more  rapidly  the  current  increases. 
When  the  source  of  emf  is  switched  off,  %  =  0: 

I  =  I0e-Rt/L. 

The  current  decreases  exponentially  from  the  initial  value  I0. 
The  higher  the  ratio  RlL,  the  more  rapidly  the  current  decreases 
(Fig.  25.4). 

25.3  Mutual  Induction 

25.3.1  An  emf  is  induced  in  all  conductors  in  the  vicinity  of  a 
circuit  in  which  the  current  varies.  This  phenomenon  is  called 
mutual  induction.  Upon  a  change  in  the  current  7t  in  circuit  1 
(Fig.  25.5),  for  example,  an  emf  <g21  of  mutual  induction  is  in¬ 
duced  in  circuit  2.  It  equals 

<g21  =  _  (in  SI  units), 

%,t  =  —  (ii»  Gaussian  units), 
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where  Om2i  is  the  magnetic  flux  through  the  surface  bounded 
by  the  second  circuit.  This  flux  is  due  to  the  magnetic  field 
set  up  by  the  current  in  the  first  circuit  ( magnetic  flux  of  mutual 
Induction). 

25.3.2  By  analogy  with  Sect.  25.2.2,  the  magnetic  flux  flbnm 
is  proportional  to  the  current  7j.  Thus 

Omsl  =  MnIx  (in  SI  units), 

1 

<I)m21  ==  ~  Iff 2j7 j  (in  Gaussian  units), 

where  Mn  is  called  the  mutual  inductance  of  the  second  and 
first  circuits.  The  quantity  M2l  depends  upon  the  shape,  di¬ 
mensions  and  relative  arrangement  of  the  circuits,  as  well  as 
the  relative  magnetic  permeability  of  the  medium  surround¬ 
ing  them.  The  magnetic  flux  Omi2  through  the  surface  bounded 
by  the  first  circuit  is  due  to  the  magnetic  field  set  up  by  current 
/2  in  the  second  circuit.  Thus 

I I ^ rn 1 2  —  H7j272  (in  SI  units), 

1 

(I)mi 2  =  —  il71272  (in  Gaussian  units), 

where  Af12  is  the  mutual  inductance  of  the  first  and  second 
circuits. 

If  the  circuits  are  in  a  nonferromagnetic  medium  (26.5.2),  then 
M2 1  =  M i2-  Fdr  ferromagnetic  media  M21  and  M12  are  not 
equal  and  depend,  in  addition  to  the  above-mentioned  factors, 
on  the  currents  in  the  two  circuits  and  the  way  in  which  they 
vary. 

25.3.3  The  expressions  for  the  emf  of  mutual  induction,  under 
the  condition  that  M21  and  M12  are  both  constant  values,  are 

$21=  —  (in  SI  units), 

1  dl 

$21= - M 2X  (in  Gaussian  units), 

$12=  (in  SI  units), 

1  dL 

$12= - (in  Gaussian  units). 

C 
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An  example  of  the  application  of  mutual  induction  is  the  prin¬ 
ciple  of  the  transformer  which  serves  to  raise  or  lower  the  volt¬ 
age  of  alternating  current. 

25.4  Energy  of  a  Magnetic  Field  Set  up  by  an 
Electric  Current  * 

25.4.1  To  produce  an  electric  current  in  a  closed  circuit  (or  loop) 
and  to  increase  this  current  from  zero  to  I  it  is  necessary  to  do 
work  W  to  overcome  the  emf  of  self-induction  that  impedes 
the  increase  in  current  (25.2.6).  Thus 

W  =  — (in  SI  units), 

(in  Gaussian  units), 

where  ®ms  is  the  magnetic  flux  of  self-induction  of  the  loop 
(25.2.2)  and  L  is  the  self-inductance  of  the  loop  (25.2.2). 
According  to  the  energy  conservation  law,  W  determines  the 
intrinsic  energy  Em  of  current  /  in  a  circuit.  Thus 

L/2 

Em=— (in  SI  units), 

Em  —  ‘~y~  (in  Gaussian  units). 

25.4.2  Increasing  together  with  the  current  in  a  circuit  is  the 
magnetic  field  set  up  by  this  current.  Hence,  the  intrinsic  energy 
of  a  current  is  dealt  with  as  energy  of  this  magnetic  field.  In 
a  long  solenoid  (23.3.8),  for  instance,  having  a  uniform  magnet¬ 
ic  field  (23.1.2), 

1 

Em  =  -g-  fi0[irn2PV  (in  SI  units), 

Em=  2:rtre2/2F  (in  Gaussian  units), 


*  It  is  assumed  in  this  section  that  the  conductors  carrying 
currents  are  in  a  nonferromagnelic,  homogeneous  and  iso¬ 
tropic  medium. 


25.4]  ENERGY  OF  A  MAGNETIC  FIELD 
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where  V  is  the  volume  of  the  solenoid,  n  is  the  number  of  turns 
per  unit  length,  p„  is  the  magnetic  constant  (23.2.2),  and  ur  is 
the  relative  permeability  of  the  medium. 

25.4.3  The  volume  energy  density  Em  of  a  magnetic  field  is  the 
energy  per  unit  volume  of  the  field.  Thus 

r.d  dE m 

m  dV  - 


Fora  uniform  magnetic  field  (23.1.3),  Em  —  Em/V. 

The  volume  energy  density  of  any,  including  a  nonuniform 
magnetic  field,  is  expressed  by  the  formula 

•f  1  IB2 

Z^=TBtf  =  T(i0MP  =  __^  (in  SI  units), 

Ez,  =  -Tr-  —  = s - -  (in  Gaussian  units). 

m  8n  8n  8it  pr  '  ' 

Here  B  and  H  are ‘the  magnitudes  of  the  vectors  of  magnetic 
induction  (23.1.2)  and  strength  (23.2.3)  at  the  point  being 
considered  in  the  magnetic  field.  The  remaining  notation  is 
defined  in  Sect.  25.4.2. 

25.4.4  The  energy  Em  of  a  magnetic  field,  localized  in  the 
volume  V  is 


/?m=  j  ^Y~dV  (in  SI  units), 

V 

Em—  ^  dV  (in  Gaussian  units). 


25.4.5  If  the  magnetic  field  is  set  up  by  an  arbitrary  system  of 
n  circuits,  or  loops,  with  the  currents  /x,  /2,  /3,  .  .  .,  the 
energy  of  the  field  can  be  expressed  by  the  formula  of  Sect. 
25.4.4,  in  which  B  and  H  are  understood  to  be  the  magnitudes 
of  vectors  B  and  H  of  the  resultant  magnetic  field,  in  accordance 
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with  the  principle  of  superposition  of  fields  (15.2.2).  Moreover, 
the  energy  of  the  magnetic  field  in  this  case  is 

n 

„  VI  Q'mhlk 
Am-  2j  2  ’ 

h=\ 


where  is  the  total  magnetic  flux  linked  to  the  fcth  loop 
(23.6.5).  In  calculating  this  flux,  the  normal  nft  to  the  surface 
bounded  by  the  loop  is  such  that  from  the  head  of  vector  nfe 
the  current  is  seen  to  be  counterclockwise.  The  magnetic  flux 
equals 


where  is  the  magnetic  flux  of  self-induction  of  the  kth 

loop  (25.2.2)  and  is  the  magnetic  flux  of  its  mutual  in¬ 

duction,  produced  by  all  the  other  loops  carrying  current 
(25.3.1).  Hence  the  energy  Em  of  the  magnetic  field  is  equal  to 

n  n  n 

2?m=  2  2  2  Mulkh '(in  si  units), 

/i=t  h= 1  l=l(l^k) 

n  LI2  n  n 

Em  =  2  2  ''  +  ~^  2  2  Mhihli  (in.  Gaussian 

ft=  l  5=1  i=i(i^h)  units). 


The  first  term  is  the  sum  of  the  intrinsic  energies  of  all  the  cur¬ 
rents  (25.4.1).  The  second  term  is  called  the  mutual  energy 
of  the  currents.  Here  Mhi  is  the  mutual  inductance  of  the 
fcth  and  Ith  loops,  or  circuits,  (25.3.2)  with  the  currents 
and 
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CHAPTER  26  MAGNETIC  MATERIALS 
IN  A  MAGNETIC  FIELD 

26.1  Magnetic  Moments  of  Electrons  and  Atoms 

26.1.1  Magnetic  materials  are  all  substances  capable  of  acquir¬ 
ing  magnetic  properties,  i.e.  of  being  magnetized,  in  an  external 
magnetic  field.  This  means  that  they  can  set  up  their  own  mag¬ 
netic  field.  The  magnetic  properties  of  substances  are  determined 
by  the  magnetic  properties  of  their  electrons  and  atoms.* 
Magnetic  materials  are  divided  according  to  their  magnetic 
properties  into  three  main  groups:  diamagnetic  (26.3.2),  para¬ 
magnetic  (26.3.5)  and  ferromagnetic  (26.5.1)  materials. 

26.1.2  The  motion  of  an  electron  along  its  orbit  in  an  atom  is 
equivalent  to  a  certain  closed  loop  carrying  a  current  ( orbital 
current).  According  to  Sect.  23.3.4,  the  orbital  magnetic  moment 
pm  of  an  electron  equals 

pm  =  IS n  (in  SI  units), 

1 

pm  =  —  75n  (in  Gaussian  units), 

where  I  —  e\  is  the  current,  e  is  the  magnitude  of  the  charge  of 
the  electron,  v  is  the  number  of  revolutions  of  the  electron  along 
its  orbit  in  unit  time,  S  is  the  area  of  the  electron  orbit,  n  is 
a  unit  vector  of  the  normal  to  area  S,  and  c  is  the  electrodynamic 
constant  (23.2.2). 

An  electron  moving  along  its  orbit  has  an  orbital  angular  mo¬ 
mentum  Le  (4.1.4).  The  orbital  magnetic  moment  is  proportion¬ 
al  to  the  orbital  angular  momentum: 

Pm  =  Y^ei 


where  y  =  - 


2m 


(in  SI  units), 


Y=  ' 


2  me 


(in  Gaussian  units). 


*  The  magnetic  properties  of  atomic  nuclei  are  dealt  with  in 
Sect.  42.1.6. 
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The  quantity  y  is  called  the  gyromagnetic  ratio  of  orbital  mo¬ 
ments. 

In  the  last  formulas  m  is  the  mass  of  the  electron,  e  is  the  magni¬ 
tude  of  its  charge,  and  c  is  the  electrodynamic  constant  (23.2.2). 

Vectors  pm  and  Le  are  opposite  in  direc¬ 
tion  and  are  perpendicular  to  the  or¬ 
bital  plane  of  the  electron  (Fig.  26.1). 
26.1.3  An  electron  has  an  intrinsic 
angular  momentum  s,  which  is  called 
the  spin  of  the  electron.  The  magnitude 
of  the  electron  spin  is 


Y 3  h  _  V'S 
2  2n  —  2 


h. 


Fig.  26.1 


where  h  is  Planck’s  constant  (Appen¬ 
dix  II),  and  h  =  hl2n.  The  most  impor¬ 
tant  feature  of  electron  spin  is  that  it  has  only  two  projections 
on  the  direction  of  vector  B  of  magnetic  induction.*  Thus 


.  * 

Sb=±2‘ 

26.1.4  Electron  spin  s  corresponds  to  the  spin  magnetic  moment 
p„s  which  is  proportional  to  the  spin  and  opposite  in  direction. 
Thus 


Pms  =  VsS- 

The  quantity  ys  is  called  the  gyromagnetic  ratio  of  spin  moments: 
£ 

Ys  = - (in  SI  units), 

m 

£ 

ys  = - (in  Gaussian  units). 

where  the  notation  is  the  same  as  in  Sect.  26.1.2. 


*  Regardless  of  whether  it  is  an  external  magnetic  field  set  up, 
for  instance,  by  current-carrying  conductors,  or  the  internal 
magnetic  field  of  the  substance  itself  (26.4.2). 
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The  projection  of  the  electron  spin  magnetic  moment  on  the 
direction  of  the  field  (26.1.3)  equals 

PmsB=  ±-£r==±^B  (in  SI  UnitS)’ 

/'msB=  (>n  Gaussian  units), 

where  p.B  is  called  the  Bohr  magneton,  and  is  a  unit  for  measur¬ 
ing  magnetic  moments  (Appendix  II). 

26.1.5  The  information  presented  in  Sects.  26.1.1  through 
26.1.4  is  valid  for  each  of  the  Z  electrons  in  an  atom.  The  number 
Z  coincides  with  the  atomic  number  in  Mendeleev’s  periodic 
table  (39.3.5). 

The  orbital  magnetic  moment  Pm  of  an  atom  is  the  vector  sum 
of  the  orbital  magnetic  moments  pm  of  all  its  electrons.  Thus 
z 

•’m  =  2  Pmi  • 
i—l 

The  orbital  angular  momentum  L  of  the  atom  is  the  vector  sum 
of  the  orbital  angular  momenta  Le  of  all  Z  electrons.  Thus 

L=  Le,. 
i=l 

The  atomic  orbital  magnetic  moment  Pm  and  angular  momentum 
L  have  the  following  relationship: 

I’m  = 

where  y  is  the  gyromagnetic  ratio  (26.1.2). 


26.2  An  Atom  in  a  Magnetic  Field 

26.2.1  If  a  substance  is  in  an  external  magnetic  field,  this  field 
can  be  assumed  uniform  (23.1.3)  within  the  limits  of  an  atom. 
This  follows  from  the  smallness  of  atomic  dimensions.  Let  us 
assume  that  an  electron  in  the  atom  travels  along  a  circular 


22 — 0186 
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ocbit  whose  plane  is  perpendicular  to  the  induction  vector  B 
of  the  magnetic  field.  The  action  of  the  Lorentz  force  El  (24.1.1) 
reduces  the  force  of  attraction  of  the  electron  to  the  nucleus. 
The  centripetal  force  (2.4.3)  is  found,  to  equal  the  difference 
Fe  —  F l,  where  Fe  is  the  Coulomb  force  (14.2.2)  of  attraction 

of  an  electron  to  the  nucleus 
(Fig.  26.2).  This  results  in  a 
change  in  the  angular  velocity 
<0  (1.5.3)  of  the  electron  along 
its  circular  orbit. 

26.2.2  The  change  ih  the  angular 
velocity  of  the.  electron  occurs 
in  the  process  of  growth  of  the 
magnetic  field  into  which  the 
atom  is  placed.  This  growth 
takes  place  during ‘a  finite  length 
of  time.  At  this  an  induced 
rotational  electric  field  (25.1.1) 
is  developed  that  acts  on  the  electron  in  the  atom.  The  strength 
E  of  this  field  is  directed  along  a  tangent  to  the  electron’s  orbit 
and  the  force  acting  on  the  electron  is  F  =  eE  (15.1.2). 

26.2.3  Upon  an  arbitrary  position  of  the  electron’s  orhit  with 
respect  to  vector  B,  the  orbital  magnetic  moment  pm  of  the 
electron  (26.1.2)  makes  an  angle  a  with  the  direction  of  the 
magnetic  field  (Fig.  26.3).  In  this  case  there  will  be  a  precession- 
al  motion  of  the  orbit  about  the  direction  of  vector  B  (4.3.2). 
This  means  that  vector  pm,  perpendicular  to  the  orbit  and  main¬ 
taining  its  angle  a  with  the  field  constant,  rotates  about  the 
direction. of:  vector  B  with  the  angular  velocity  <ol  .  Thus 


(in  SI  units), 


c»l  = 


ell 

2mc 


(in  Gaussian  units). 


Here  e  is  the  magnitude  of  the  charge  of  the  electron,  m  is  its 
mass,  H  is  the  magnetic  field  strength,  c  is  the  electrodynamic 
constant  (23.2i2),  and  col  is  called  the  angular  velocity  o.f  Larmor 
precession. 

Larmor' s  theorem  states  that  the-  only  result  of  the  effect  of  a 
magnetic  field  on  an  electrons  orbit  in  an  atom  is  the  preces- 
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sion  of  the  orbit  and  vector  p.n  with  the  angular  velocity  (oL 
about  an  axis  through  the  nucleus  of  the  atom  and  parallel  to 
the  induction  vector  B  of  the  magnetic  field. 

26.2.4  The  precessional  motion  of  the  electron’s  orbit  results 
in  an  additional  orbital  current  A/orb  (Fig.  26.3)  and  the  cor- 


Fig.  26.3 


responding  induced  orbital  magnetic  moment  Apm  (26.1.2) 
with  the  magnitude 

Apm  =  A/orb'?  ,  =  (in  SI  units), 

1 

Apm  =  —  lovbS  !  =  ^mc2'  B  (in  Gaussian  units), 


where  S±  is  the  area  of  the  projection  of  the  orbit  on  a  plane 
perpendicular  to  the  direction  of  vector  B.  Vector  Apm  has 


22* 


o40 
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the  direction  opposite  to  the  magnetic  induction  vector  B. 
Thus 


Apn 


Apm = 


in  m 
e*S 


-B  (in  Si  units), 


i_B  (in  Gaussian  units). 


4  nme2 

The  notation  is  the  same  as  in  Sect.  26.2.3. 

The  total  induced  orbital  moment  APm  of  the  atom  (26.1.5) 
is 

c2Z  (S  ,> 

APm  = - - B  (in  SI  units), 


APn 


4ji  m 

(S±) 

4  nmc2 


B  (in  Gaussian  units), 


where  Z  is  the  number  of  electrons  in  the  atom,  and  (Si)  = 

/Z  is  the  average  area  of  the  projections  of  the 

electron  orbits  on  a  plane  perpendicular  to  the  direction  of 
vector  B. 


26.3  Diamagnetic  and  Paramagnetic  Materials  in  a 
Uniform  Magnetic  Field 

26.3.1  The  magnetization  of  matter  is  characterized  by  the 
magnetization  vector  ( magnetization  intensity)  M,  which  is  the 
ratio  of  the  magnetic  moment  of  a  small  volume  AF  of  the  sub¬ 
stance  to  the  magnitude  of  this  volume.  Thus 

N 

M  =  ~W  2  Pmi’ 

i=l 

where  Pmi  is  the  magnetic  moment  of  the  ith  atom  (or  molec  le), 
and  N  is  the  total  number  of  atoms  (or  molecules)  in  the  small 
volume  AF.  Within  the  limits  of  the  volume  AF  the  magnetic 
field  is  assumed  uniform.  This  volume  should  simultaneously 
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contain  a  sufficiently  large  number  of  particles  N  (i.e.  N  >  1) 
to  enable  the  physical  quantities  characterizing  the  system  of 
particles  to  be  expediently  averaged. 

26.3.2  Diamagnetic  materials  are  substances  in  which  the  mag¬ 
netic  moments  of  the  atoms  (or  molecules)  equal  zero  in  the 
absence  of  an  external  magnetic  field.  This  means  that  in  dia¬ 
magnetic  materials  the  vector  sum  of  the  orbital  magnetic 
moments  of  all  the  electrons  in  an  atom  equals  zero  (26.1.5) 
and  that  induced  magnetic  moments  (26.2.4)  exist  only  in  a 
magnetic  field. 

26.3.3  In  the  volume  AV  of  an  isotropic  diamagnetic  material, 
the  induced  magnetic  moments  APm  of  all  the  atoms  (or  mole¬ 
cules)  are  the  same  and  are  opposite  in  direction  to  vector  B 
(26.2.4). 

The  magnetization  vector  M  is  equal  to 


M  = 


iVAPm 

AV 


no APm> 


and,  taking  Sect.  26.2.4  into  account: 
n0e2Z  { S  .  ) 

M= - A-  B  (in  SI  units), 

n0e2Z  (S  .) 

M= - 4nmc2 - ®  (ln  Gaussian  units), 

where  n0  is  the  number  of  atoms  (or  molecules)  in  unit  volume. 
The  other  notation  is  the  same  as  in  Sect.  26.2.4.  Introducing 
the  notation: 


Xm 


n0e2Z(S  ±)n0 
4nm 

nae2Z  (S±  ) 
4  nmc2 


(in  SI  units), 

(in  Gaussian  units), 


we  have 


g 

M  =  Xm -  (in  SI  units), 

M-o 

M=/^jB  (in  Gaussian  units), 
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where  Xm  is  a  dimensionless  quantity  that  characterizes  the  mag¬ 
netic  properties  of  magnetic  materials.  For  all  diamagnetic 
materials  Xm  <  0- 

26.3.4  The  magnetic  susceptibility  X ^  is  a  quantity  that  is  re¬ 
lated  to  %'m  as  follows: 

1 

l  +  Xm=T— 77—  (in  SI  units), 

1  % 


.1 " f- 


1 

1 


(in  Gaussian  units), 


from  which 


X' 

x">  =  7T_V .  (in  SI  units), 

yj 

Xra  =  — — (in  Gqussian  units). 

Practically,  Xm  =  X'm  for  diamagnetic  materials,  because  in 
absolute  value  x^  is  very  small:  \Vm  \  &  10~6. 

26.3.5  Paramagnetic  materials  are  substances  in  which  the 
atoms  (or  molecules)  have  a  certain  constant  magnetic  moment 
Pm  in  the  absence  of  an  external  magnetic  field.  This  means 
that  the  vector  sum  of  the  orbital  magnetic  moments  of  all 

:the ;  electrons,  of  an  atom  (or  molecule)  is  not  equal  to  zero 
(26.1.5). 

26.3.6  When  a  paramagnetic  material  is  placed  into  a  uniform 
magnetic  field  (23.1.3),  the  constant  magnetic  moments  of  the 
atoms  (or  molecules)  precess  about  the  direction  of  the  induc¬ 
tion  vector  B  of  the  magnetic  field  at  Larmor’s  angular  velocity 
t»L  (26.2.3). 

Thermal  motion  and  collisions  of  the  atoms  (or  molecules)  of 
a  paramagnetic  material  load  to  a  gradual  damping  of  the  pre¬ 
cession  of  the  magnetic  moments  and  to  a  reduction  in  the  an¬ 
gles  between  the  directions  of  the  magnetic  moment  vectors 
and  vector  B.  The  combined  actions  of  the  interatomic  collisions 
and  the  magnetic  field  lead  to  a  predominant  orientation  of  the 
magnetic  moments  of  the  atoms,  in  alignment  with,  the,  )ex$ernal 
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field.  Though  the  constant  magnetic  moment  Pm  of  an  atom 
(or  molecule)  is  of  the  order  of  lO-23  joule/tesla  (lO-20  erg/ gauss)', 
the  magnetic  moments  of  all  the  particles  in  unit  volume  pro¬ 
duce  a  magnetization  that  considerably  exceeds  diamagnetic 
phenomena  (26.3.3).  A  paramagnetic  material  in  an  external 
magnetic  field  has  an  intrinsic  magnetic  field  aligned  with  the 
external  magnetic  field. 

26.3.7  In  the  classical  theory  of  paramagnetism  the  magnitude 
of  the  magnetization  vector  (26.3.1)  is  expressed  by  the  formula 

M  =  n0PmL  (a),  ? 


where  n0  is  the  number  of  atoms  (or  molecules)  in  unit  volume, 
and  L  (a)  is  the  classical  Langevin  function:  ■' 


L(a) 


ea-\-e~a 
ea — e~a 


'  -  a 


The  parameter  a  is  of  the  form  a  —  PmB/kT,  where  B  is  the 
magnetic  induction  of  the  field,  k  is  Boltzmann’s  constant 
(8.4.5),  and  T  is  the  absolute  temperature.  At  room  tempera¬ 
ture  and  not  especially  strong  external  .fields,  a  •€.  1  and  the 
function  L  (a)  is,  simplified,  after  expansion  into  a  series,  and 
we  obtain  L  (a)  V a/3.  Hence,  the  magnetization  vector,  is 
equal  to 

M  =  X'mT7"  (in  ST  units), 

Fo 

M  =  XmB  (in  Gaussian  units), 


where  7^  is  determined  by  the  formula 
noPfn^o 


Xu 


3kT 


(in  SI  units), 


n(]Pl.. 

Xm=  3 kT  (in  Gaussian  units). 

The  quantity  Xm  is  related  to  the  magnetic  susceptibility  Xro 
of  paramagnetic  materials  by  the  formulas  given  in  Sect.  26.3.4. 
The  value  of  the  quantity  Xm  is  positive  for  a  paramagnetic 
material  and  'ranges  from  10-6  to  10~3;  therefore,  Xm  =  Xm 
with  a  high  degree  of  accuracy. 
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The  Curie  law  states  that  the  paramagnetic  susceptibility  of 
a  substance  is  inversely  proportional  to  the  absolute  tempera¬ 
ture. 

26.3.8  In  very  strong  external  magnetic  fields,  saturation  magne¬ 
tization  is  reached;  here  a»  1  and  the  Langevin  function 
L  (a)  -*■  1.  This  means  that  the  magnetic  moments  of  all  the 
atoms  (or  molecules)  are  aligned  with  the  external  magnetic 
field  and  M  —  re0Pm. 


26.4  Magnetic  Field  in  Magnetic  Materials 

26.4.1  Two  types  of  currents  that  set  up  magnetic  fields,  ma¬ 
crocurrents  and  microcurrents,  can  be  distinguished  in  matter. 
What  we  have  called  macrocurrents  here  are  conduction  currents 

(20.1.2)  and  convection  currents  (20.1.2).  Microcurrents  (mole¬ 
cular  currents)  are  those  due  to  the  motion  of  electrons  in  atoms, 
molecules  and  ions. 

The  magnetic  field  in  a  substance  is  the  vector  sum  of  two  fields 

(15.2.2) :  the  external  magnetic  field,  set  up  by  the  macrocurrents, 
and  the  internal,  or  intrinsic,  magnetic  field,  set  up  by  micro¬ 
currents.  The  magnetic  induction  vector  B  (23.1.2)  in  the  mag¬ 
netic  field  in  a  substance  characterizes  the  resultant  magnetic 
field  and  is  equal  to  the  vector  sum  of  the  magnetic  inductions 
of  the  external  B0  and  intrinsic  Bmtr  magnetic  fields.  Thus 


B  —  B0  +  Bintr- 

The  primary  source  of  magnetic  fields  in  magnetic  materials 
are  the  macrocurrents.  Their  magnetic  fields  are  what  leads 
to  the  magnetization  of  substances  placed  in  an  external  mag¬ 
netic  field. 

26.4.2  The  total  current  law  for  a  magnetic  field  in  a  substance 
is  a  generalization  of  the  law  formulated  in  Sect.  23.5.2: 

i^)  B  dl  =p0  (/mac-)-/mjc)  (in  SI  units),| 

L 

C  4J11 

®  Bdl  =  -y-:(fmac-[-/mlc)  (in  Gaussian  units), 

L 
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where  /mac  and  /mic  are  the  algebraic  sums  of  the  macro-  and 
microcurrents  passing  through  a  surface  bounded  by  the  loop  L. 
26.4.3  The  algebraic  sum  of  the  microcurrents  is  related  to 
the  magnetization  vector  M  as  follows: 

/1Illc  =  (|)  M  dl  (in  SI  units), 

L 

Amic  =  c  Mdl  (in  Gaussian  units), 

L 


where  (J)  Mdl  is  the  circulation  of  the  magnetization  vector  M 

(26.3.1)  along  the  closed  loop  L  through  which  the  microcurrents 
are  threaded. 

The  final  form  of  the  total  current  law  (26.4.2)  is 

<$  f— - m)  dl  =  /mac  (in  SI  units), 

L 

A")  (B— 4jtM)  dl  =  — —  /mac  (in  Gaussian  units). 

i, 

26.4.4.  The  vector 

H  =  - M  (in  SI  units), 

M-o 

H  =  B  —  4jtM  (in  Gaussian  units) 

is  called  the  strength  of  the  magnetic  field  existing  in  an  arbi¬ 
trary  medium  (cf.  Sect.  23.2.3).  The  total  curreqt,  law  for  a 
magnetic  field  in  an  arbitrary  medium  is  writlen  in*ff  form  iden¬ 
tical  to  that  in  Sect.  23.5.4: 


(|)Hdl  =  /mac  (in  SI  units), 

i. 


L 


(in  Gaussian  units). 
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The  circulation  of  the  magnetic  field  strength  vector  along  an 
arbitrary  closed  loop  is  equal  (or  proportional)  to  the  algebraic 
sum  of  the  macrocurrents  through  the  surface  bounded  by  this 
loop. 

26.4.5  For  an  isotropic  medium,  the  relation  between  the  mag¬ 
netic  induction  vector  B  and  the  magnetization  vector  M 
(26.3.1)  leads  to  the  following  result  for  vector  H  (26.4.4): 

(in  SI  units), 

DO 

H  =  (l  —  ®  (in  Gaussian  units). 

Substituting  for  (1  —  %m)  or  (1  —  4  nym)  on  the  basis  of  Sect. 
26.3.4,  we  have 

Hj  =  — (in  SI  units), 

PoPr 

g 

H  =  -  (in  Gaussian  units), 

Pr 

where 

pr  =  1  +  Xm  (in  SI  units), 

pr  —  1  +  4nym  (in  Gaussian  units). 

The  quantity  pr  thus  introduced  is  called  the  relative  magnetic 
permeability  of  the  substance  (see  also  Sect.  23.2.1).  In  these 
formulas  %m  is  the  magnetic  susceptibility  (26.3.4). 


26.5  Ferromagnetic  Materials 

26.5.1  Ferromagnetic  materials  are  substances  in  which  the 
intrinsic  (internal)  magnetic  field  (26.4.1)  is  hundreds  or  even 
thousands  of  times  stronger  than  the  external  magnetic  field 
applied  to  the  substance. 

Ferromagnetism  is  observed  in  the  crystals  of  the  transition 
metals  (39.3.8),  iron,  cobalt  and  nickel  and  of  certain  alloys, 
provided  that  d/a ;>  1.5,  where  d  is  the  diameter  of  the  atom 
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and  a  is  the  diameter  of  the  unfilled  electronic  shell  of  the  atom 
(39.3.6). 

26.5.2  The  principal  properties  of  ferromagnetic  materials, 
distinguishing  them  from  other  magnetic  materials  are: 

(a)  The  dependence  of  the  magnetization  (20.3.1)  on  the  strength 
H  of  the  external  magnetic  field  is  characterized  by  magnetic 
saturation  Ms,  which  begins  at  Hs  (Fig-  26.4). 

(b)  The  dependence  of  the  magnetic  induction  B  on  H  is  charac¬ 
terized  by  the  increase  according  to  a  linear  law  at  Hs 
(Fig.  26.5). 


Fig.  26.4 


Fig.  26.5 


(c)  The  dependence  of  the  relative  magnetic  permeability  |xr 
on  the  field  strength  II  is  of  a  complex  nature  (Fig.  26.6). 

(d)  Magnetic  hysteresis  exists  in  ferromagnetic  materials.  This 
refers  to  the  lag  in  the  variation  of  magnetization  from  the 
variation  in  the  strength  of  the  external  magnetizing  field, 
which  is  variable  in  magnitude  and  direction.  This  lag  is  due 
to  the  dependence  of  M  on  its  previous  values  (past  history) 
in  magnetizing  the  substance. 

(e)  The  properties  of  ferromagnetic  materials  enumerated 
above*  are  observed  at  temperatures  below  the  Curie  point  0c 
(18.4.4).  At  temperatures  T  0(j,  thermal  motion  breaks  up 
the  regions  of  spontaneous  magnetization  (26.5.4)  and  the 


*  As  well  as  many  others  that  are  beyond  the  scope  of  this 
handbook. 


348 


MAGNETIC  MATERIALS  IN  A  MAGNETIC  FIELD  [26 


ferromagnetic  material,  losing  its  special  features,  is  converted 
into  a  paramagnetic  substance  (26.3.5).  The  Curie  points  for 
iron,  nickel,  cobalt  and  Permalloy  are  1043,  631,  1403  and 
823  K. 

26.5.3  The  hysteresis  loop  is  a  curve  showing  the  change  in  the 
magnetization  of  a  ferromagnetic  material  located  in  an  exter¬ 
nal  magnetic  field  as  the  strength  of  this  field  is  varied  from  +//s 
to  —  Hs  and  back  again,  where  II  s  is  the  field  strength  corre¬ 
sponding  to  magnetic  saturation  (Fig.  26.7).  The  magnetiza¬ 
tion  ±Afs  at  H  =  +Ha  is  called  saturation  magnetization.  The 
magnetization  ±  M r  that  remains  in  a  ferromagnetic  material 


Fig.  26.6 


in  the  absence  of  an  external  field  (H  =  0)  is  called  the  residual, 
or  remanent,  magnetization.  This  factor  is  the  basis  for  making 
permanent  magnets.  The  strength  ±HC  of  the  external  field 
that  completely  demagnetizes  a  substance  is  called  the  coercive 
force.  The  coercive  force  specifies  the  property  of  a  ferromagnetic 
material  to  retain  its  residual  magnetization.  Hard,  or  retentive, 
magnetic  materials  have  a  high  coercive  force.  They  have  a  wide 
hysteresis  loop  and  are  used  to  make  permanent  magnets.  Soft, 
or  non-retentive,  magnetic  materials  have  a  low  coercive  force. 
They  have  a  narrow  hysteresis  loop  and  are  used  to  make  trans¬ 
former  cores. 

The  reversal  of  magnetization  is  associated  with  a  change  in 
the  orientation  of  the  regions  of  spontaneous  magnetization 
(26.5.4)  and  requires  that  work  be  done  at  the  expense  of  the 
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energy  of  the  external  magnetic  field.  The  amount  of  heat  evolved 
in  reversing  magnetization  is  proportional  to  the  area  of  the 
hysteresis  loop. 

26.5.4  At  temperatures  below  the  Curie  point  (26.5.2)  a  ferro¬ 
magnetic  material  is  made  up  of  domains ,  which  are  small  re¬ 
gions  of  uniform  spontaneous  magnetization.  The  linear  dimen¬ 
sions  of  these  domains  range  from  10*5  to  lO-4  m.  Within  each 
domain  the  substance  is  magnetized  to  saturation  (26.5.2). 
In  the  absence  of  an  external  magnetic  field,  the  magnetic  mo¬ 
ments  of  the  various  domains  are  oriented  in  space  so  that  the 
resultant  magnetic  moment  of  the  whole  ferromagnetic  body 
equals  zero. 

Owing  to  the  effect  of  the  external  magnetic  field,  the  magnetic 
moments  of  whole  domains  are  oriented  in  a  ferromagnetic 
material,  rather  than  those  of  the  separate  particles  as  in  the 
case  of  paramagnetic  materials  (26.3.6).  As  a  result,  the  sub¬ 
stance  is  magnetized. 

ferromagnetic  properties  can  only  exist  in  the  crystalline  state 
of  substances  in  which  the  interaction  between  the  neighbouring 
atoms  of  the  crystal  lattice  leads  to  the  total  energy  of  the  sys¬ 
tem  of  electrons  that  provides  for  compliance  with  the  condi¬ 
tions  for  the  existence  of  ferromagnetism  (26.5.1). 

26.5.5  Measurement  of  the  gyromagnetic  ratio  (26.1.4)  for 
ferromagnetic  bodies  indicates  that  the  elementary  carriers 
of  magnetism  in  ferromagnetic  materials  are  the  spin  magnetic 
moments  of  the  electrons  (26.1.4).  The  modern  quantum  mechan¬ 
ical  theory  of  ferromagnetism  explains  the  nature  of  the  spon¬ 
taneous  magnetization  of  ferromagnetic  materials  (26.5.4)  and 
the  reasons  for  the  development  of  a  strong  intrinsic  magnetic 
field  (26.4.1). 

F erromagnetic  properties  are  possessed  by  crystals  of  substances 
whose  atoms  have  inner  shells  and  subshells  (39.3.6)  unfilled 
by  electrons  and  having  a  nonzero  value  of  the  projections  of 
the  resultant  spin  moment  on  the  direction  of  the  magnetic 
field  (26.1.3).  Developed  between  the  spins  of  such  electrons 
is  a  specific  kind  of  quantum  mechanical  interaction  that  is 
not  of  a  magnetic  origin.  It  is  called  exchange  interaction 
(39.4.5).  As  a  result,  the  state  of  the  system  of  electrons  in  fer¬ 
romagnetic  materials  is  found  to  be  stable  upon  parallel  align¬ 
ment  of  the  spins,  and  spontaneous  magnetization  (26.5.4) 
is  developed  along  with  a  strong  intrinsic  magnetic  field. 
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CHAPTER  27  FUNDAMENTALS 

OF  MAXWELL'S  THEORY 


27.1  General  Features  of  Maxwell's  Theory 

27.1.1  Maxwell's  theory  is  a  systematic  theory  of  a  unified 
electromagnetic  field  (15.1.2)  set  up  by  an  arbitrary  system  of 
charges  and  currents.  Maxwell’s  theory  enables  the  principal 
problem  of  electrodynamics  to  be  solved:  determining  the  char¬ 
acteristics  of  electric  and  magnetic  fields  on  the  basis  of  a  given 
distribution  of  charges  and  currents  that  set  up  these  fields. 
Maxwell’s  theory  is  a  generalization  of  fundamental  laws  that 
describe  electrical  and  electromagnetic  phenomena.  These 
include  the  Ostrogradsky-Gauss  theorem  (18.3.3),  the  total  cur¬ 
rent  law  (26.4.2),  and  the  law  of  electromagnetic  induction 
(25.1.2). 

27.1.2  Maxwell’s  theory  is  of  a  phenomenological  nature. 
This  is  manifested  in  the  fact  that  it  does  not  deal  with  the 
internal  mechanism  of  the  phenomena  that  occur  in  a  medium 
and  lead  to  the  production  of  electric  and  magnetic  fields.  The 
medium  is  described  in  this  theory  by  three  quantities  that 
specify  its  electrical  and  magnetic  properties.  These  are  the 
relative  permittivity  (dielectric  constant)  er  (18. 3M),  relative 
magnetic  permeability  pr  (26.4.5)  and  the  electrical  conductiv¬ 
ity  x  (20.3.4). 

27.1.3.  Maxwell’s  theory  deals  with  macroscopic  fields  set 
up  by  macroscopic  charges  and  currents  that  are  concentrated 
in  volumes  incommensurably  greater  than  those  of  atoms  (or 
molecules).  It  is  assumed  that  the  distances  from  the  sources 
of  the  fields  to  the  points  in  space  being  considered  substan¬ 
tially  exceed  the  linear  dimensions  of  atoms  (or  molecules). 
Therefore,  macroscopic  fields  change  appreciably  only  at  dis¬ 
tances  that  are  huge  compared  to  the  linear  dimensions  of  atoms 
(or  molecules).  Moreover,  the  periods  of  time  required  for  changes 
in  variable  electric  and  magnetic  fields  are  considered  to  be 
considerably  longer  than  those  required  for  intramolecular 
processes. 

27.1.4  The  macroscopic  charges  and  currents  are  the  sets  of 
microscopic  charges  and  currents  that  set  up  their  microfields 
(electric  and  magnetic),  varying  continuously  with  time  at 
each  point  in  space. 
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The  macroscopic  fields  dealt  with  in  Maxwell’s  theory  are 
averaged  microfields.  Averaging  is  carried  out  over  time  inter¬ 
vals  considerably  longer  than  those  typical  of  interatomic 
processes,  and  over  volumes  of  fields  many  times  greater  than 
the  volumes  of  atoms  and  molecules  (27.4.5). 

27.1.5  Maxwell’s  theory  is  one  of  short-range  action,  according 
to  which  electrical  and  magnetic  interactions  take  place  in 
electric  and  magnetic  fields  and  propagate  at  a  finite  velocity 
equal  to  that  of  light  in  the  given  medium.  This  important 
consequence  is  taken  into  account  in  the  electromagnetic  theory 
of  light  developed  by  Maxwell. 


27.2  Maxwell's  First  Equation 

27.2.1  Maxwell's  first  equation  in  integral  form  is  a  generaliza¬ 
tion  of  Faraday’s  law  of  electromagnetic  induction  in  the  form 
(25.1.2): 

|)Bdl= - (in  SI  units), 

L 

(jj)Edl  =  —  (in  Gaussian  units). 

L 

According  to  Maxwell,  this  law  is  valid  for  any  closed  circuit, 
or  loop,  and  hot  only  current-conducting  ones,  conceivably 
selected  in  a  variable  magnetic  field.  This  means  that  a  variable 
magnetic  field  sets  up  a  rotational  electric  field  at  any  point  in 
space,  regardless  of  whether  or  not  there  is  a  conductor  at  that 
point. 

27.2.2  Using  the  following  expression  for  the  magnetic  flux 
(23.5.5): 

cfim  =|j  KdS=§BndS 
8  S 

and  the  Stokes  theorem  from  vector  analysis: 
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where  dS  —  n  dS  (n  being  the  unit  vector  of  the  normal  to  the 
element  dS  of  surface),  we  can  convert  Maxwell’s  first  equation 
as  given  in  Sect.  27.2.1  to  Maxwell's  first  equation  in  differential 
form: 


curl  E= - —  (in  SI  units), 

1  dB 

curlE= - —  —  ■  (in  Gaussian  units). 


Here  curl  E  is  expressed  in  Cartesian  coordinates  by  the  deter¬ 
minant 

i  j  k 

Odd 

curl  E=  al  aF  ~oI  ■ 

Ex  Ey  Ez 

27.2.3  An  inductive  type  of  electron 
accelerator,  the  betatron  (24.4.2),  is 
based  on  setting  up  a  rotational 
electric  field  in  space  by  the  action 
of  an  alternating  magnetic  field. 

In  the  alternating  magnetic  field  pro¬ 
duced  by  an  electromagnet  with  coni¬ 
cal  pole  pieces  A  and  C  (Fig.  27.1), 
a  rotational  electric  field  is  set  up  in 
accelerating  chamber  D,  having  the 
form  of  a  closed  ring.  The  lines  of 
force  (15.1.6)  of  the  rotational  elec- 
Fig.  27.1  trie  geld  lie  in  plane  MN,  which 

is  perpendicular  to  the  axis  00' 
of  symmetry  of  the  pole  pieces.  These  lines  are  circles  with 
their  centre  at  point  K.  At  all  points  of  any  circle  the 
field  strength  vector  E  has  a  constant  numerical  value  and  is 
tangent  to  the  circle.  Electrons  in  the  accelerating  chamber  travel 
along  circular  paths  and  after  repeated  travel  along  the  stable 
circular  orbit  the  electron  is  accelerated  to  considerable  energy. 
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27.2.4  The  strength  E  of  the  rotational  electric  field  in  a  beta¬ 
tron  is  equal  to 

2  dt  ’ 

where  ( B )  is  the  average  value  of  magnetic  induction  at  the 
instant  of  time  t  within  the  limits  of  the  circular  electron  orbit 
of  radius  r. 

A  condition  for  a  stable  orbit  of  the  electron  in  the  betatron  is: 
B  =  (B)l 2,  where  B  is  the  instantaneous  value  of  the  magnetic 
induction  on  the  orbit. 

The  orbit  of  an  electron  in  the  betatron  is  stable  if: 

(a)  The  whole  orbit  lies  in  a  single  plane.  This  condition,  called 
axial  focusing,  is  achieved  due  to  the  special  shape  of  the  pole 
pieces  of  the  electromagnet.  This  shape  provides  for  gradual 
weakening  of  the  magnetic  field  induction  in  the  direction 
from  the  centre  of  the  orbit  to  its  periphery. 

S  Provision  is  made  to  return  electrons  that  have  acciden- 
ly  left  the  stable  orbit  back  to  it  again.  This  is  called  radial 
focusing  and  is  achieved  by  a  space  distribution  of  the  magnet¬ 
ic  field  in  which  the.  magnetic  induction  decreases  more  slowly 
from  the  axis  to  the  periphery  than  1/r,  where  r  is  the  distance 
from  a  given  point  in  the  field  to  the  axis  of  symmetry  00'. 


27.3  Displacement  Current.  Maxwell's  Second 
Equation 

27.3.1  Maxwell  generalized  the  total  current  law,  (26.4.2)  and 
(26.4.4),  by  assuming  that  a  variable  electric  field,  in  the  same 
way  as  an  electric  current,  is  a  source  of  a  magnetic  field.  The 
quantitative  measure  of  the  magnetic  effect  of  a  variable  electric 
field  is  the  displacement  current. 

27.3.2  The  displacement  current  density  (20.2.3)  is 

diy 

jets  =  -Qf  (in  SI  units), 

jdis  Gaussian  units). 
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The  displacement  current  through  the  arbitrary  surface  S  is 
a  physical  quantity  numerically  equal  to  the  flux  of  the  dis¬ 
placement  current  density  vector  through  this  surface. 
Thus 


jcils  = 


jdis  — 


f  dD 
j  dt 


S 


(in  SI  units), 


1 


1  dD  1  dOe 

4  n  dt  4n  dt 


(in  Gaussian 
units), 


where  Oe=  j  D  dS  is  the  flux  of  the  electric  displacement 
s 

vector  through  the  surface  S  (15.3.2). 

Taking  the  displacement  currents  into  account,  any  variable 
current  has  a  closed  circuit.  Displacement  currents  “flow”  through 
a  part  of  a  circuit  where  there  is  no  conductor,  for  example, 
between  the  plates  of  a  capacitor  being  charged  or  discharged. 
Shown  in  Fig.  27.2  are  the  vectors  j,jiS  and  the  lines  of  induction 
of  the  magnetic  fields  of  displacement  currents  in  charging 
(Fig-  27.2 a)  and  in  discharging  (Fig.  27.26)  a  capacitor. 

27.3.3  According  to  Sect.  18.3.4,  the  displacement  vector  in 
any  dielectric  is  equal  to 

D  =  e0E  -|-  Pe  (in  SI  units), 

D  =  E  +  4nPe  (in  cgse  units), 


where  Pe  is  the  polarization  vector  (18.2.3). 

The  displacement  current  density  in  a  dielectric  is 


jdis  —  eo 


dE 


dPP 


dt 


dt 


(in  SI  units), 


1 


4jt  dt 


dE  |  dPe 


jdis  — 


dt 


(in  cgse  units). 


27.3]  MAXWELL’S  SECOND  EQUATION 


355 


I  n  these  last  formulas,  the  first  term  e0  —  f  or,  correspondingly, 
1  dE  \ 

^  —  1  is  called  the  displacement  current  density  in  vacuum. 


whereas  the  second  term  dV e/dt  is  called  the  polarization  current 
density.  The  second  term  is  the  density  of  the  current  due  to 


Fig.  27.2 


Ihe  ordered  motion  of  charges  in  the  dielectric:  either  the  dis¬ 
placement  of  charges  in  a  molecule  of  a  nonpolar  dielectric. 
(18.1.3)  or  the  turning  of  the  dipoles  in  polar  dielectrics  (18.1.5). 
The  displacement  current  does  not  evolve  Joule  heal  (21.2.6) 
in  vacuum  or  in  metals.  In  this  it  differs  from  conduction  cur¬ 
rents. 

27.3.4  Maxwell  added  the  displacement  current  to  the  right- 
hand  side  of  the,  total  current  law  in  the  form  given  in  Sect. 
20.4.4,  writing  this  law  in  the  form 

(^)  H  dl  —  /mac "I-  fdis  (In  SI  units), 

i. 
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f*  4xt 

ipHiil  = - (Anac  +  ^dis)  (in  Gaussian  units). 

V  C 

L 

This  equation  is  called  Maxwell's  second  equation  in  integral 
form.  It  shows  that  the  circulation  of  the  magnetic  field  strength 
vector  along  an  arbitrary  closed  circuit,  or  loop,  L  is  equal  to 
the  algebraic  sum  of  the  macrocurrents  and  the  displacement 
current  through  a  surface  bounded  by  this  loop. 

27.3.5  Making  use  of  the  Stokes  theorem  from  vector  analysis 


H  dl=  <J)  curl 
L  s 


H  dS,  [where  dS  =  ndS,  and  n  is  a  unit 


vector  of  the  normal  to  the  element  dS  of  surface,  and  of  the 
expression  for  the  total  current 


7  — 7mac  +  /dis 


(3  +  idis)  dS, 


Maxwell's  second  equation  can  be  written  in  differential  form: 
curl  H  =  (in  SI  units), 

curl  H  =  -^~  j  +  —  (in  Gaussian  units), 

c  cdt  - 

In  these  equations,  curl  H  has  the  same  meaning  as  curl  E  in 
Sect.  27.2.2. 

27.3.6  In  the  absence  of  conduction  currents  (j  =  0),  Maxwell’s 
first  and  second  equations  are  of  symmetrical  form  except  for 
sign  in  their  right-hand  sides: 

curlE= - —  ,  curl  II  =  (in  SI  units), 

curlE= - --^2-,  curl  H  =  —  -22-  (in  Gaussian  units). 

cdt  cdt 


Comparing  Maxwell’s  equations*,  we  reach  the  following  con¬ 
clusions: 

*  The  numeration  of  Maxwell’s  equations  is  a  conditional  one 
and  is  frequently  the  opposite  of  that  accepted  in  this  handbook. 
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<?D 

dt 


dB_ 

dt 


(a)  Electric  and  magnetic  fields  are  interrelated:  a  change  of 
an  electric  field  with  time  sets  up  a  magnetic  field*.  In  its 
turn,  a  variable  magnetic  field  is  the  source  of  a  rotational  elec¬ 
tric  field. 

(b)  The  different  signs  at  the  right-hand  sides  of  Maxwell’s 
equations  comply  with  the  law  of  conservation  of  energy  and 
with  the  Lenz  law  (25.1.4).  If 
the  signs  of  SB Idt  and  SD/Si 
were  the  same,  an  infinitesimal 
increase  in  one  of  the  fields  would 
lead  to  an  increase  without  li¬ 
mit  of  both  fields,  whereas  an 
infinitesimal  decrease  in  one  of 
the  fields  would  lead  to  the  dis¬ 
appearance  of  both  fields.  The 
above-mentioned  difference  in 
the  signs  of  the  right-hand 
sides  of  Maxwell’s  equations  is  a 
necessary  condition  for  the  exi¬ 
stence  of  a  'stable  electromag¬ 
netic  field. 

The  different  signs  in  the  right-hand  sides  of  Maxwell’s  equa¬ 
tions  correspond  to  the  fact  that  the  directions  of  dT)ldt  and  H 
form  a  “right-hand  screw”  system  (Fig.  27.3 a),  whereas  the 
directions  of  dB/dt  and  E  form  a  “left-hand  screw”  system 
(Fig.  27.3  b). 


H 


(a) 

Fig.  27.3 


(V 


27.4  Complete  Set  of  Maxwell's  Equations  for  an 
Electromagnetic  Field 

27.4.1  A  complete  set  of  Maxwell’s  equations  for  an  electro¬ 
magnetic  field  includes,  in  addition  to  the  equations  dealt  with 
in  Sects.  27.2.1,  27.2.2,  27.3.4  and  27.3.5,  the  Ostrogradsky- 
Gauss  theorem  for  an  electric  field  (18.3.3): 

v|)DiS  =  g{ree  (in  Si  units), 

s 


•  The  magnetic  field  is  always  a  rotational  ono  (23.5.3). 
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r* 

)j)  D  dS  =  4ng[ ree  (in  cgse  units) 

S 

and  the  same  theorem  for  a  magnetic  field  (23.5.6): 
<§  BtfS^O. 
s 

Maxwell  proposed  that  the  theorem  for  the  flux  of  an  electric 
field  displacement  vector  is  valid,  not  only  for  a  steady-state 
electrostatic  field,  but  for  a  variable  electric  field  as  well. 
27.4.2  Making  use  of  the  Gauss  theorem  from  vector  analysis: 

AdS=  ^  divAdF,  and  introducing  the  volume  density  p 
S  V 

of  free  charges:  p  =  dqtreeldV  (where  dV  is  an  element  of  vol¬ 
ume),  it  is  possible  to  obtain  Maxwell's  third  equation  in  differen¬ 
tial  form,'. 

div  D  =  p  (in  SI  units), 
div  D  =  4itp  (in  Gaussian  units). 

In  these  equations,  div  A  (where  A  is  an  arbitrary  vector)  is 
determined  in  Cartesian  coordinates  as  follows: 


div  A  = 


&AX 

dx 


dy 


dAz 
dz  ' 


* 


where  A  =  Axi  +  Ayj  +  Az k,  and  i,  j,  and  k  are  unit  vectors 
along  the  coordinate  axes. 

27.4.3  The  complete  set  of  Maxwell's  equations  includes  four 
equations: 


curl  E  = 


<?B 

dt 


div  l>  =  p 


(in  SI  units), 


curl  H  '=  j + rgp ,  divj  B  =  0 
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curl  E= — -4^-,  divD  =  4jxp 
c  ot 

(in  Gaussian  units). 

curl  H  =  — j-}  — -^7- ,  div  B  =  0. 

c  c  at 

27. 4.4  The  system  of  Maxwell’s  equations  is  supplemented  by 
equations  that  specify  the  electric  and  magnetic  properties 
of  the  medium.  For  macrocurrents  in  an  isotropic  medium,  which 
comply  with  Ohm’s  law  (20.3.4),  these  equations  are  of  the 
form 

D  =  e0erE,  B  =  |x0Mth»  imac  =  *E  (in  SI  units), 

D  =  erE,  B  =  [irH,  jmao  =  xE  (in  Gaussian  units). 

Here  e0  and  p0  are  the  electric  constant  (14.2.7)  and  the  magnet¬ 
ic  constant  (23.2.2)  in  SI  units,  er  and  pr  are  the  relative  dielec¬ 
tric  constant  and  relative  magnetic  permeability,  and  x  is  the 
electrical  conductivity. 

To  solve  a  system  of  Maxwell’s  equations,  it  is  also  necessary 
to  specify  the  boundary  conditions  for  the  vectors  character¬ 
izing  the  electromagnetic  field.  Thus 


Eh~Eh’ 

(in  SI  units), 

fin1  =  fin2’ 

■fft2==isup 

-£>n3=4n0, 

Eti  =  EU’ 

(in  Gaussian  units). 

®ni =  ^n2’ 

in 

-  **-*«,— r 

7sur 

where  a  is  the  surface  density  of  free  charges,  n  is  a  unit  vector 
of  the  normal  to  the  interface  (boundary)  between  media  and 
extends  from  medium  2  to  medium  1,  t  is  a  unit  vector  of  the 
tangent  to  the  interface,  and  /  SUr  is  the  projection  of  the  density 
vector  of  surface  conduction  currents  on  the  direction  [tn]. 
For  given  boundary  and  initial  conditions,  i.e,  known  values 
of  •  vectors- ,E  -and  JH  at  the'  initial  instant  of  time  (f  ~  ,(})*• 
system  of  Maxwell’s  equations  has  a  unique  solution. 
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27.4.5  Maxwell’s  equations  are  invariant  with  respect  to  the 
Lorentz  transformations  (5.3.2). 

In  the  special  theory  of  relativity  (5.1.1)  it  is  shown  that  a 
common  electromagnetic  field  can  be  different  in  different  iner¬ 
tial  frames  of  reference  (2.1.2).  In  particular,  one  of  the  fields, 
either  electric  or  magnetic,  may  be  absent  in  one  of  the  coordi¬ 
nate  systems  and  be  present  in  the  other.  The  following  formu¬ 
las  are  used  for  the  Lorentz  transformations  of  the  projections 
on  the  coordinate  axes  of  the  vectors  E,  H,  D  and  B  of  the  elec¬ 
tric  and  magnetic  fields,  in  going  over  from  fixed  inertial  frame 
of  reference  K  to  frame  K' ,  which  travels  with  respect  to  frame 
K  at  the  constant  velocity  F  in  a  straight  line  along  the  OX 
axis: 


Ex  =  E'a, 
HX  =  H'X> 

DX  =  D'X, 

Bx=B'x, 
EX  =  E'X, 


,  K+VK 

„  E'z-VB'v 

y  /l  — F2/c2  ’ 

Z  Y 1  —  F2/c2 

rr  K-VK 

„  H'z  +  VDV 

V  Y 1  —  F2/c2  ’ 

Z  Y 1  — F2/c2  ’ 

Dy  +  H'z 

K— lr  h'v 

n.  — . 

V  Yl  —  V2lc*  ’ 

Yl  —  F2/c2 

Y 1 — F2/c2  ’ 

*  Y 1  —  F2/c2  • 

Eu 


EV  + 

V 1  —  F2/c2  ’ 


v 

K — -Hv 
z  c  y 

Vl  —  VVe* 


(in  SI 
units), 


(in  Gaus¬ 
sian 
units). 


HX  =  H’X, 


H'v- 

Hy=,~— 

Y  i- 


v_ 

c 


K 


K+- 


-V*/c* 


E'v 


yi-W 


27.4.6  The  classical  Lorentz  theory  of  the  electron  is  a  further 
development  of  Maxwell’s  theory  of  the  electromagnetic  field. 
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The  Lorentz  theory  is  based  on  a  definite  model  that  implies 
certain  conceptions  of  the  structure  of  matter.  It  is  assumed  that 
atoms  consist  of  negatively  and  positively  charged  particles, 
and  that  the  great  variety  of  electrical  and  magnetic  phenomena 
can  be  explained  by  a  definite  spatial  arrangement,  motion  and 
interaction  of  charges  and  microcurrents.  At  each  point  in 
space  certain  electric  and  magnetic  microfields  exist  with  the 
strengths  e  and  h.  These  fields  result  from  the  combined  action 
of  all  the  charges  and  microcurrents.  The  micro  fields  comply 
with  a  system  of  equations  similar  to  Maxwell’s  equations 

(27.4.3) .  Maxwell’s  equations  for  macroscopic  fields  E  and  B 

(27.1.3)  can  be  obtained  by  averaging  the  equations  of  the 
electron  theory  (27.1.4),  and  they  are  found  to  be  the  averaged 
microfields  e  and  h.  Thus 

E  =  (e),  B  =  p0  (h)  (in  SI  units). 

Vectors  D  and  H  are  found  to  be  related  to  (e)  and  (h)  by  the 
polarization  vector  Pe  (18.2.3)  and  the  magnetization  vector  M 
(26.3.1)  as  indicated  in  Sects.  18.3.4  and  26.4.4.  Thus 

r> 

D  =  e0E  +  Pe,  H  = - M  (in  SI  units), 

*  i1# 

D  =  E-j-4ftPe,  H  =  B — 4nM  (in  Gaussian  units). 


PART  FOUR  OSCILLATIONS 
AND  WAVES 


CHAPTER  28  FREE  HARMONIC  OSCILLATIONS 


28.1  Harmonic  Oscillations 

28.1.1  Oscillations,  including  vibrations  and  variations,  are 
processes  (motions  or  changes  of  state)  that  are  repeated  more  or 
less  regularly  with  time.  According  to  the  physical  nature  of 
the  oscillatory  process  and  the  “mechanism”  by  which  it  is 
initiated,  distinction  is  made  between  mechanical  oscillations, 
commonly  called  vibrations  (swinging  of  pendulums;  vibration 
of  strings;  reciprocation  and  oscillation  of  machine  components; 
vibration  of  buildings,  bridges  and  other  structures;  compres¬ 
sion  and  rarefaction  of  air  as  sound  is  propagated  through  it; 
rolling,  pitching  and  heaving  of  ships;  sea  waves;  etc.),  electro¬ 
magnetic  oscillations  (oscillations  of  alternating  electric  current 
in  a  circuit,  variations  in  the  field  strength  and  magnetic  induc¬ 
tion  vectors,  E  and  B,  of  an  alternating  electromagnetic  field, 
etc.),  and  electromechanical  vibrations  (vibration  of  the  dia¬ 
phragm  of  a  telephone  receiver,  of  the  diaphragm  in  an  elec¬ 
tromagnetic  loudspeaker,  etc.)  and  others. 

The  system  executing  the  oscillation  (or  vibration)  is  called 
the  oscillatory  ( vibratory )  system.  Free  ( natural )  oscillations  are 
ones  that  occur  in  the  absence  of  externally  applied  variable 
action  on  the  system.  They  are  due  to  some  initial  deviation 
of  the  system  from  a  state  of  stable  equilibrium.  Forced  oscil¬ 
lations  ( vibrations )  are  ones  set  up  in  a  system  as  a  result  of 
variable  external  effects  (for  instance,  current  oscillation  in  an 
electric  circuit  caused  by  a  varying  emf,  swinging  of  a  pendulum 
due  to  varying  external  forces,  etc.). 

28.1.2  Oscillations  (vibrations)  are  said  to  be  periodic  if  all  the 
physical  quantities  that  characterize  the  oscillatory  (vibra¬ 
tory)  system  and  yary  during  its  oscillation  are  repeated  in 
successive  time  intervals.  The  minimum  time  interval  T  com- 
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plying  with  this  condition  is  called  the  period  of  oscillation 
(or  vibration).  This  is  the  time  required  for  one  complete  oscil¬ 
lation  (vibration)  of  the  system.  The  frequency  of  periodic  os¬ 
cillations  (vibrations)  is  the  quantity  v  =  1  IT,  equal  to  the 
number  of  complete  oscillations  (vibrations)  executed  by  the 
system  in  unit  time.  The  angular,  or  circular,  frequency  is  the 
quantity  w  =  2itv  =  2n/T,  which  is  equal  to  the  number  of 
complete  oscillations  accomplished  in  2n  units  of  time. 

28.1.3  In  periodic  oscillations  the  dependence  of  a  varying 
quantity  s  on  the  time  t  complies  with  the  condition  s  (t  +  T)  = 
=  *  (t). 

The  periodic  oscillation  of  the  quantity  s  ( t )  is  said  to  be  har¬ 
monic  if 

s  (t)  —  A  sin  (cof  -(-  q>0)  or  s  (t)  =  A  cos  (°>I  +  <Pj), 

where  <n  =  2jxv  =  (2n IT)  =  const  is  the  angular,  or  circular, 
frequency  of  harmonic  vibration,  A  =  smax  =  const  >  0  is  the 
maximum  value  of  the  varying  quantity  s  and  is  called  the 
amplitude  of  oscillation,  and  <p0  and  q>i  =  <Po  - •  n/2  are  con¬ 
stant  quantities.  The  value  of  s  at  some  arbitrary  instant  of 
time  t  is  determined  by  the  phase  of  the  oscillation  ®  (t)  = 
=  (ot  +  cp0  [and,  respectively,  ( t )  —  a>t  +  q>,].  The  quanti¬ 
ties  <p0  and  (pt  are  the  initial  phases  of  the  oscillation,  i.e.  the 
values  of  ®  (t)  and  ( t )  at  the  instant  (t  —  0)  of  time  refer¬ 
ence:  (p0  =  <!>•  (0)  and  q)x  =  ®x  (0). 

28.1.4  The  first  and  second  time  derivatives  of  a  harmonically 
varying  quantity  s  ( t )  also  vary  harmonically  at  the  same  angu¬ 
lar  frequency: 

©*+<Po+-y- 

^  — Aw2  sin  ((of-|-  q>0)  =Aa>2  sin  ((Dl  +  q>0-fji), 

where  the  amplitudes  of  ds/dt  and  d2s/dt 2  are  equal,  respectively, 
to  Ao)  and  Aafi.  The  initial  phase  of  ds/dt  is  equal  to  (q>0  +  a/2), 
i.e.  the  phase  difference  of  the  variations  of  ds/dt  ana  s  is  con¬ 
stant  and  equal  to  n/2  (the  quantity  ds/dt  has  a  phase  lead  of 
ji/2  with  respect  to  s).  The  initial  phase.- of  d2s/dt2_, is  equal,  to. 
(<p0  +  n),  i.e.  the  phase  difference  of  the  variations  of  d2s/dt2 


ds  ( 

=  Ad)  cos  (cat  +  q>„)  =  sin  [ 
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and  s  is  constant  and  equal  to  at  (the  quantity  d2s/dt2  has  a 
phase  lead  ot  n  with  respect  to  s).  Curves  of  the  dependence  on 
the  time  t  of  the  quantities  s,  dsidl  and  d2sldt2  for  harmonic  varia¬ 
tions  and  with  <p0  =  0  are  shown  in  Fig.  28.1. 


28.1.5  It  is  evident  from  the  second  equation  in  Sect.  28.1.4 
that  a  harmonically  varying  quantity  s  satisfies  the  differential 
equation 


d2s 
dt 2 


coas=0. 


The  general  solution  of  this  equation  is  of  the  form 
s  =  Ai  sin  a>t  +  A2  cos  to t. 


where  Ar  and  A  2  are  arbitrary  constants  of  integration.  The 
values  of  and  A  2  can  be  found  from  the  initial  conditions, 
i.e.  when  the  values  of  s  and  ds/dt  are  known  for  the  initial  in¬ 
stant  of  time  it  =  0).  Thus 
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The  general  solution  can  be  transformed  to  the  standard  form 
for  harmonic  oscillations  (vibrations)  given  in  Sect.  28.1.3: 

s  =  A  sin  (at  +  q>0), 

where  A=  +  and  q>0  =  arc  tan  {A2IA{). 

Thus,  the  quantity  s  has  harmonic  variations  if  and  only  if 
it  satisfies  the  differential  equation  given  at  the  beginning  of 
this  subsection.  This  equation 
is  called  the  differential  equation 
of  harmonic  variations  (or  vibra¬ 
tions  or  oscillations). 

28.1.6  Harmonic  oscillations  (vi¬ 
brations)  can  be  represented 
graphically  by  means  of  a  vec¬ 
tor  in  a  plane.  For  this  purpose 
vector  A  is  drawn  from  the  ori¬ 
gin  of  coordinates  0  in  the  plane 
XY  (Fig.  28.2).  The  magnitude 
of  vector  A  is  equal  to  the  am¬ 
plitude  A  of  the  oscillation 
being  considered  and  the  vector  Fig.  28.2 
makes  the  angle  <p  =  at  +  q>0, 

equal  to  the  phase  of  the  oscilla¬ 
tion  at  the  given  instant  of  time  t,  with  the  coordinate  axis  OX. 
In  the  course  of  time  angle  q>  increases  so  that  vector  A  rotates 
at  a  constant  angular  velocity  about  point  O  equal  to  the  angular 
frequency  of  oscillation  (vibration)  w.  The  corresponding  pro¬ 
jection  of  vector  A  on  the  vertical  axis  OY  varies  harmonically 
according  to  the  relation 

Ay  —  s  =  A  sin  (cot  +  <p„). 

The  graphical  representation  of  harmonic  vibrations  (or  oscil¬ 
lations  or  variations)  by  means  of  a  rotating  amplitude  vector 
is  called  the  vector  diagram  method.  It  is  extensively  applied, 
for  instance,  in  adding  together  harmonic  oscillations  having 
the  same  direction  (28.4.2). 

28.1.7  According  to  the  Euler  formula  for  complex  numbers, 
ei<P  =  cos  qj  — j—  i  sin  q>, 

where  i  =  V"—i.  Hence  the  harmonic  oscillation  (vibration) 
s  =  A  sin  (at  +  cp0)  =  A  cos  (®f  +  <pi),  where  cpx  =  q>0  —  n/2, 
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can  be  written  in  the  exponential  form 
7^Aeiat=Aei(">t+<Pi\ 

where  A  =  A  ei’Pi  is  the  complex  amplitude.  Only  the  real  part 
of  the  complex  function  s,  denoted  by  Re  {s},  has  a  physical 
meaning: 

Re  {s}  =  s  =  A  cos  (o>f  -f-  <pj)  =  A  sin  (a>£  +  <Pob 
where  <p0  =  <Pi  +  it/ 2. 


28.2  Mechanical  Harmonic  Vibrations 

28.2.1  If  a  particle  vibrates  harmonically  in  a  straight  line 
along  the  coordinate  axis  OX  about  the  equilibrium  position, 
taken  as  the  origin  of  coordinates,  the  dependence  of  the  coor¬ 
dinate  x  of  the  particle  on  the  time  t  is  of  the  form  given  in 
Sect.  28.1.3  (with  s=  x).  Thus 

x  =  A  sin  (o U  +  (p0). 

The  projections  of  the  velocity  v  and  acceleration  a  of  the  par¬ 
ticle  on  the  OX  axis  are: 

vx  =  cos  (Wi  +  To)  and  ax  —  —  ao  sin  (o H  -|-  q>0), 

where  v0  —  A  to  is  the  amplitude  of  the  velocity,  and  a0  = 
=  A  co2  =  v0a  is  the  amplitude  of  the  acceleration. 

The  force  F  acting  on  the  particle  equals 

F  =  mu  and  Fx  =  —  nno2x, 

where  m  is  the  mass  of  the  particle.  Consequently,  force  F  is 
proportional  to  the  displacement  of  the  particle  from  the  equi¬ 
librium  position  and  opposite  in  direction: 

F  =  —  me o2xi, 

where  i  is  a  unit  vector  of  the  OX  axis. 

Such  a  dependence  of  the  force  on  the  displacement  is  typical 
for  an  elastic  force  (3.3.C).  Hence,  forces  of  other  physical  ori¬ 
gin,  satisfying  the  same  kind  of  relationship,  are  said  to  be 
quasi-elastic  forces. 
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28.2.2  The  kinetic  energy  of  a  particle  vibrating  harmonically 
in  a  straight  line  is  equal  to 


R  k  =  - 


vl  ,  ,  .  .  ,  mw2A2  „  .  .  .  \ 

pL  cos2  (to/  +  cp0)  = - o - cos  (®: 1  +  To) . 


m  A  2 

E  k=  -  -[1  +  cos  (2fflt+2cp0)]. 


The  kinetic  energy  of  the  particle  varies  periodically  from  0 
to  m(t>2A2l2  as  it  vibrates  harmonically  with  the  angular  fre¬ 
quency  2(o  and  the  amplitude  moA42/4  about  its  average  value 
of  rooA42/4. 

The  potential  energy  of  a  particle  vibrating  harmonically  in  a 
s  raight  line  due  to  the  effect  of  a  quasi-elastic  force  is  equal  to 


Ep  -  ~  i 


m  <a6x* 


mcoM2 


sin2  (w^+cpo), 


or 


E  P  = 


mai2A2 


[1— cos  (2(ot-)-2(Po)] 


mu>2A2 


[  1  -f  cos  (2 mt  +  2(|'0  -j-  it)] . 


The  potential  energy  of  the  particle  varies  periodically  from 
0  to  nm2A 2 1 2  as  it  vibrates  harmonically  with  the  angular  fre¬ 
quency  2<»  and  the  amplitude  mcoM2/4  about  its  average  value 
of  ma2A2IA.  The  variations  of  the  potential  and  kinetic  energies 
have  a  phase  difference  of  it,  so  that  the  total  mechanical  energy 
of  the  particle  remains  constant  in  vibration: 


E  =  Ek +  Ev  =  =Const . 

Curves  of  the  dependence  of  Ek,  E p  and  E  on  the  time  t  for  the 
case  when  cp0  =  0  are  shown  in  Fig.  28.3. 
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28.2.3  Example  1.  A  linear  harmonic  oscillator  is  a  particle 
of  mass  m  having  harmonic  vibration  along  a  straight  line  by 
the  action  of  elastic  force  Fej  =  —  kxi  (3.3.6).  Serving  as  an 
example  of  such  a  system  is  the  spring  pendulum ,  where  a  weight 
of  mass  m  is  suspended  by  an  absolutely  elastic  spring  (fc  is  a 


factor  specifying  the  elastic  properties  of  the  spring).  The  mo¬ 
tion  equation  is 


m 


dAx 

ItA 


kx 


or 


d^x  k 

dt 2  m 


x  =  0. 


It  follows  from  Sect.  28.1.5  that  an  oscillator  (spring  pendulum) 
has  harmonic  vibration  according  to  the  equation  x  = 
=  A  sin  (cot  +  cp0)  with  an  angular  frequency  co  and  period  T 
equal  to 

and  r=2jt  j/ i. 


The  potential  energy  of  a  linear  harmonic  oscillator  is 


28.2.4  Example  2.  A  compound,  or  physical,  pendulum  is  a 
rigid  body  capable  of  oscillating  under  the  action  of  the  grav¬ 
ity  force  mg  about  a  horizontal  axis  0,  which  does  not  pass 
through  the  centre  of  gravity  of  the  body,  and  is  called  the  axis 
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oj  oscillation,  or  swing  (Fig.  28.4).  The  centre  of  gravity  of  the 
pendulum  coincides  with  its  centre  of  mass  C  (7.3.2).  Point 
<),  the  intersection  of  the  axis  of 
oscillation  with  a  vertical  plane 
passing  through  the  centre  of 
gravity  of  the  pendulum  and  per¬ 
pendicular  to  its  axis  of  oscilla¬ 
tion,  is  called  its  point  of  suspen¬ 
sion. 

In  the  absence  of  friction  forces 
in  the  suspension,  the  motion  equa¬ 
tion  of  the  pendulum  is  of  the 
form  (4.3.4) 

r  d~a  i  ■ 

l  — r-r—  —  mud  sin  a, 
dt‘ 

where  a  is  the  angle  of  devia¬ 
tion  of  the  pendulum  about  its 
axis  of  oscillation  from  the  equi¬ 
librium  position,  d  =■  OC  is  the  p-  gg  4 
distance  from  the  .centre  of  mass  '4 

of  the  pendulum  to  its  axis  of 

oscillation,  /  is  the  moment  of  inertia  (4.2.1)  about  the  same 
axis,  m  is  the  mass  of  the  pendulum,  and  g  is  the  free  fall 
acceleration.  Upon  small  oscillations  of  the  pendulum  sin  a  »  a 
and  the  motion  equation  of  the  pendulum  is  of  the  form 

rf2  a  ,  mgd  n  , 

"rfF""1  T~  C==°’ 

i.e.  angle  a  satisfies  the  differential  equation  of  harmonic  vi¬ 
brations  (28.1.5).  Hence,  in  the  absence  of  friction,  small 
oscillations  of  a  compound  pendulum  are  harmonic: 
oc  —  a0  sin  (tot  -|-  q>0), 

where  a0  is  the  amplitude  of  variation  of  angle  a,  and 

(  /]^[  am)  T^2n  ^  jZ 

II  V  mgd 

are  the  angular  frequency  and  period  of  small  oscillations  of 
a  compound  pendulum. 

24-0186 
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28.2.5  Example  3.  A  simple  pendulum  is  a  particle  suspended 
by  a  weightless,  inextensible  thread  and  oscillating  in  a  vertical 
plane  by  the  action  of  the  gravity  force.  The  simple  pendulum 
is  the  limiting  case  of  the  compound  (physical)  pendulum,  whose 
whole  mass  is  concentrated  at  the  centre  of  mass,  so  that  d  =  l, 
the  length  of  the  simple  pendulum.  The  moment  of  inertia  of 
a  simple  pendulum  with  respect  to  its  axis  of  oscillation  is 
I  =  ml2.  Correspondingly,  the  angular  frequency  and  period 
of  small  oscillations  of  this  pendulum  are 

co  =  ~y/~  and  T  —  2n  j/~  y  . 

Small  oscillations  of  compound  and  simple  pendulums  are 
examples  of  isochronous  vibrations,  i.e.  vibrations  whose  fre¬ 
quency  and  period  are  independent  of  their  amplitude. 

In  the  general  case,  the  period  of  a  compound  pendulum  depends 
upon  its  amplitude  a0.  Thus 


T  =--2n 


sin2  - 


(  1 

3  ' 

\  2  .  an 

1 

(  2 

'  4  , 

The  variation  in  the  value  of  T  when  a0  is  increased  to  15°  does 
not  exceed  0.5  per  cent. 

28.2.6  The  equivalent  length  le(1  o/  a  compound  pendulum  is 
the  length  of  a  simple  pendulum  that  has  the  same  period  of 
oscillation.  Thus 

I  Ir 

lea  =  — r  =  <H - ~r  >  d, 

el  md  '  md 


where  I c  is  the  moment  of  inertia  of  the  compound  pendulum 
with  respect  to  an  axis  passing  through  the  centre  of  mass  C 
of  the  pendulum  and  parallel  to  its  axis  of  oscillation.  Point 
O],  lying  on  the  straight  line  OC  at  the  distance  of  Zeq  from  the 
point  of  suspension  O  (Pig.  28.4),  is  called  the  centre  oj  oscilla¬ 
tion  oj  the  compound  pendulum.  The  centre  of  oscillation  Oj 
and  the  point  of  suspension  0  are  interchangeable:  if  the  pen¬ 
dulum  is  suspended  so  that  its  axis  of  oscillation  passes  through 
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point  Ou  then  point  0  will  coincide  with  the  new  position  of 
the  centre  of  oscillation,  i.c.  the  equivalent  length  and  period 
of  oscillation  of  the  pendulum  remain  unchanged. 

28.2.7  Example  4.  Small  free  vibrations  of  electrons  in  a  plasma 
(22.6.1)  are  called  Langmuir  oscillations  of  the  plasma.  They 
are  caused  by  the  forces  of  an  elec¬ 
tric  field,  which  is  set  up  in  an  elec¬ 
trically  neutral  plasma  as  the  result 
of  some  random  deviation  of  the 
spatial  distribution  of  the  electrons 
from  their  equilibrium  distribution. 

Assume  that  in  a  flat  layer  of  plasma 
of  thickness  l  (Fig.  28.5)  the  elec¬ 
trons  are  displaced  over  the  small 
distance  s  along  the  positive  direc¬ 
tion  of  axis  OX.  This  leads  to  the 
formation  of  a  surplus  positive  charge 
at  the  left-hand  part  of  the  layer  and 
a  surplus  negative  charge  at  the 
right-hand  part.  Consequently,  an  electric  field  is  set  up,  whose 
strength  E  is  along  the  OX  axis  and  its  projection  on  this  axis 
is  Ex  =  n0esl e0,  where  n0  is  the  electron  concentration  in  the 
plasma,  e  is  the  magnitude  of  the  charge  of  the  electron,  and 
s0  is  the  electric  constant.  According  to  Newton’s  second  law, 
the  motion  equation  for  the  electrons  of  the  plasma  in  this  elec¬ 
tric  field  is  of  the  form: 


Fig.  28.5 


m 


d2s 

dt2 


=  — eEx  = 


80 


or 


d2s 
dt 2 


e2n0 

me„ 


s  =  0, 


where  m  is  the  mass  of  the  electron.  Thus,  the  electrons  of  the 
plasma  have  free  harmonic  vibrations  at  the  angular  frequency 


which  is  called  the  Langmuir  plasma  frequency,  or  simply,  the 
plasma  frequency. 
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28.3  Free  Harmonic  Oscillations  in  an 
Oscillatory  Electric  Circuit 


J>0 


28.3.1  An  example  of  an  electric  circuit  in  which  free  electric 
oscillations  may  occur  is  the  simple  oscillatory  circuit  (Fig.  28.6). 
It  consists  of  a  capacitor  of  capacitance  C  connected  in  series  to 
an  induction  coil  with  the  self-induc¬ 
tance  L.  When  the  previously  charged 
capacitor  is  connected  to  the  coil, 
free  oscillations  of  the  capacitor  charge 
and  the  current  in  the  coil  are  pro¬ 
duced.  This  sets  up  a  variable  elec¬ 
tromagnetic  field  that  propagates 
through  space  at  a  velocity  equal  to 
that  of  light.  Hence,  if  the  linear  di¬ 
mensions  l  of  the  circuit  are  not 
too  large  (/  <c  c/v,  where  c  =  3  X 
Xl08m/sis  the  velocity  of  light  in 
vacuum,  and  v  is  the  frequency  of  oscillation  in  the  circuit), 
it  can  be  assumed  that  at  each  instant  of  time  t  the  current  / 
is  the  same  at  all  parts  of  the  circuit.  Such  a  variable  current  is 
said  to  be  quasi-stationary .  According  to  Ohm’s  law  (21.2.3) 
for  the  portion  l-L-2  of  the  circuit  (Fig.  28.6) 


Fig.  28.6 


IR=V1  —  F2+©s  or  IR 


~-t-L 


dl 

dt 


Here  q  and  V1  —  V2  =  —  q/C  are  the  charge  of  the  capacitor 
and  the  potential  difference  of  its  plates  at  the  arbitrary  instant. 
t  of  time  being  considered,  R  is  the  electric  resistance  of  the 
oscillatory  circuit,  i.o.  of  the  part  l-L-2  of  the  circuit,  and 
iSs  =  — L  ( dl/dt )  is  einf  of  self-induction  in  the  coil  (25.2.4). 
If  follows  from  the  law  of  conservation  of  electric  charge  (14.1.3) 
that  the  quasi-stationary  current  in  the  circuit  is  I  ---  dqldt. 
Therefore,  the  differential  equation  for  the  variation  of  charge 
q  is  of  the  form 

i  _d±  , _ i__n 

dt*  '  L  dt  '  LC 
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2N.3.2  l*’rce  electric  oscillations  in  an  oscillatory  circuit  are 
harmonic  if  its  electrical  resistance  R  =  0.  Thus 


The  angular  frequency  u>  and  the  period  T  of  these  oscillations 
satisfy  the  Thomson  formula'. 

Hi  -  ■  /  ■  -  and  T  =  2nY  LC  . 

V  LC 

The  charge  q  of  the  capacitor  and  current  I  in  the  circuit  vary 
according  to  the  equations 

</  =  r/„  sin  (cot  ~f  ip0)  and 

/  ^/0cos(<of  +  q0)  =  /0  sin  (  o)t  +  <p0-f  j  , 

where  </„  is  the  amplitude  of  the  capacitor  charge,  /„  =  wq0  = 
;  qj  Y  LC  is  the  amplitude  of  the  current,  and  q)0  is  the  ini¬ 
tial  phase  of  capacitor  charge  oscillation.  The  current  in  the 
circuit  has  a  phase  lead  of  jt/2  over  the  capacitor  charge. 

The  potential  difference  u  =  Vz  —  V1  of  the  capacitor  plates 
also  varies  according  to  a  harmonic  law  and  is  in  phase  with 
charge  q.  Thus 

u  =  -^r—U0  sin  (coi  +  To), 

\ 

where  U0  -=  qJC  is  the  amplitude  of  the  potential  difference. 
The  amplitude  of  the  current  is 

fn  =  u*  j/  4-. 

The  quantity  Y L/C  is  called  the  characteristic ,  or  wave,  impe¬ 
dance  of  the  oscillatory  circuit. 

28.3.3  With  free  harmonic  oscillations  in  an  oscillatory  circuit, 
there  is  a  periodic  transfer  of  the  energy  Ee  of  the  electric  field 
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of  the  capacitor  into  energy  Em  of  the  magnetic  field  of  the  in¬ 
ductance  coil  and  back  again.  Thus 

E e  =  ~%r=z  sil‘2  (<°'  +  To)  =  [1  —  cos  (2<ol  +  2%)] , 

1  11  111  1,1 2 

Em  =  cos2  (col  +  cpo)  =  [  1  +  cos  (2(01  +  2%)]. 

For  this  reason,  the  oscillations  in  an  electrical  oscillatory  cir¬ 
cuit  are  frequently  said  to  be  electromagnetic  oscillations  in  the 
circuit. 

The  values  of  Ee  and  Em  vary  in  electromagnetic  oscillations 
within  the  limits  from  0  to  their  maximum  values,  equal,  re¬ 
spectively,  to  ij|/2C  and  Ll\l 2,  in  which  qr§/2 C  =  LIp2.  The 
variations  of  Ee  and  Em  are  displaced  in  phase.  At  the  instants 
of  time  when  Ec  —  0,  Em  =  Em  max  =  LI\l2.  On  the  other 
hand,  when  Em  =  0,  Ee  =  Eem ax  =  q|/2C.  The  total  energy 
of  electromagnetic  oscillations  is  constant  with  time.  Thus 

E  =  Ee+  Em  =  -|£-  =  =  const. 


28.4  Adding  Harmonic  Oscillations 

28.4.1  The  addition  of  oscillations  (vibrations  or  variations) 
consists  in  finding  the  equation  for  the  resulting  oscillation  of 
a  system  when  this  system  simultaneously  participates  in  several 
oscillatory  processes.  Distinction  is  made  between  two  limiting 
cases:  the  combining  of  oscillations  having  the  same  direction 
and  the  combining  of  mutually  perpendicular  oscillations.  The 
first  case  corresponds,  for  example,  to  the  vibration  of  weight  1 
(Fig.  28.7)  which  vibrates  with  respect  to  weight  2  on  spring  a, 
and  together  with  weight  2  on  spring  b.  This  same  situation 
occurs  in  superposing  the  variations  of  scalar  physical  character¬ 
istics  of  an  oscillatory  or  variable  system  (pressure,  temperature, 
density,  electric  charge,  current,  etc.). 

28.4.2  Two  harmonic  oscillations  having  the  same  direction,  such 
as  =  A i  sin  (o)1t  +  %)  and  s2  =  A2  sin  (co2f  +  qp2)  can  be 
added  together  by  using  the  vector  diagram  method  (28.1.6). 
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Shown  in  Fig.  28.8  are  the  vectors  Aj  ( t )  and  A2  ( l )  of  the  first 
and  second  oscillations  at  the  arbitrary  instant  t  of  time  when 
the  phases  of  these  oscillations  equal:  (t)  —  C0j«  +  tp,  and 

<P2  (t)  =  «2f  +  <P2-  The  resultant  oscillation  s  —  sx  +  s2  cor- 


Fig.  28.7  Fig.  28.8 


lesponds  to  the  vector  A  (t)  =  A]  (t)  +  A2  ( i )  whose  projec¬ 
tion  on  the  vertical  axis  OY  is  equal  to  s.  Thus 

s  =  A  ( t )  sin  O  (t). 

According  to  the  cosine  law 

[A  (01*  =  AH-  Ai  +  2 AtAa  cos  [<I>2  (1)  -  <t>,  (t) ], 

and 

Mn  (T)  ,  .  A]  sin  (t)  A2  sin  Q2  (t) 

'  Aj  cos  cfij  (t)-\-A2  cos  cP2  (t) 

28.4.3  Two  harmonic  oscillations  st  and  s2  are  said  to  be  coherent 
if  their  phase  difference  is  independent  of  time.  Thus 

[<P3  (/)  — Oj  (0]  =  0  and  (J>2  (0 —  (i)  =  const. 
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Since  <t>2  (0  —  0)  —  (“2  —  mi)  t  +  (w2  —  cpj),  the  angular 

frequency  of  coherent  oscillations  should  be  the  same,  i.e. 
<o2  =  (Dj  =  m.  At  any  instant  of  time  the  phase  difference  of 
coherent  oscillations  is  equal  to  the  difference  of  their  initial 
phases:  ®2  ( t )  —  <I>!  (()  =  cp2  —  <Pi -  Consequently,  the  resultant 
oscillation  is  harmonic  and  has  the  same  angular  frequency  <0, 
i.e. 

s  =  Sj  +  s2  =  A  sin  (wt  +  <p0), 
where 

A2  —  A'*  +  A%  +  2A,A2  cos  (cp2  —  fpi) 
and 


tan  cpu  =  Sin  ^  Sin  (P2  • 

Ax  cos  cpj  -\-A2  cos  cp2 

In  accordance  with  the  values  of  the  initial  phase  differences 
of  .the  oscillations  being  added  together,  the  amplitude  A  of  the 
resultant  oscillation  varies  within  the  limits 

from  A  =  |  Ax  —  A2  I  at  q>2  —  =  ±(2 m  +  1)  it 

to  A  =  At  +  A 2  at  cp2  —  cpi  =  ±2mn 

where  m  —  0,  1,  2,  ...  is  any  integral  nonnegative  number. 
When  (p2  —  qij  =  +2mjt,  the  oscillations  being  added  are  said 
to  be  cophased  (or  simply,  in  phase),  whereas  at  cp2  —  cpi  = 
=  +  (2m  +  1)  at  they  are  said  to  be  antiphased  (or  in  opposition). 
28.4.4  Harmonic  oscillations  of  different  frequency  (co2  (Oj) 
are  incoherent  because  their  phase  difference,  equal  to 
(co2  —  tOj)  t  +  (cp2  —  qjj),  varies  continuously  with  time.  If 
such  oscillations  are  superposed,  an  anharmonic  resultant 
oscillation  is  obtained.  The  amplitude  vectors  A]  and  A,  of  the 
added  oscillations  (Fig.  28.8)  rotate  at  different  velocities,  so 
that  the  parallelogram  constructed  on  the  vectors  is  continuously 
deformed  and  its  diagonal,  vector  A  of  the  resultant  oscillation, 
varies  in  length  and  rotates  at  variable  angular  velocity. 

Two  harmonic  oscillations  having  different  angular  frequencies 
and  w2  can  be  approximately  considered  to  be  coherent  only 
during  the  time  interval  A t  in  which  the  phase  difference  of  these 
oscillations  varies  only  slightly:  |  to2  —  <ox  |  At  <  2it  or  At  < 
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■C  TCOh»  whore  xCoh  =  2n/|  w2  —  tOj  |  is  the  coherence  time 
of  the  oscillations  being  considered. 

28.4.5  Anharmonic  oscillation  obtained  as  a  result  of  super¬ 
posing  two  harmonic  oscillations  having  the  same  direction  and 
slightly  differing  frequencies  (|  <o2  —  oq  |  <  oq)  produces  what 
are  called  beats.  It  proves  convenient  to  take  as  the  time  reference 
I.  the  instant  when  the  phases  of  the  two  oscillations  being  super¬ 
posed,  sx  and  s2,  coincide  and  are  equal  to  cp0.  Then  — 
—  A1  sin  (cOji  -|-  <Po)  and  *2  =  ^2  sin  (co2t  -|-  (p„)  — 

--  A 2  sin  [Wjt  qj0  +  cp  (<)],  where  cp  (l)  =  (co2  —  oq)  t.  The 
resultant  oscillation  satisfies  the  relation 

s  =  A  ( t )  sin  [oqt  +  <p„  +  i|)  (<)], 

where 

Id  (t)]2  =  A  j  +  A  |  +  2 A1Ai  cos  q>  (<) 
and 


tan  \j>  (t) 


A2  sin  (p  ( t ) 

A1  +  A.,  cos  q)  (I) ' 


In  particular,  when  Ax  =  A2  =  Ag, 


4  /  |\  A  0)*> -  COi  J  .  ,  .  (Do  (D* 

-  l  (t)=2A0  cos — — —  t  and  i|?(0  =  — ^ ~  t 
so  that 


-  2d0  cos ■ 


—  H-To)  ■ 


I'lie  quantity  |  A  (t)  |,  characterizing  the  swing  of  the  oscilla- 
lions  in  beating,  varies  within  the  limits  from  \  At  —  d2  |  to 
I,  -|-  A .z  with  the  angular  frequency  Q  =  |  w2  —  eq  |,  called 
the  angular  beat  frequency.  Since  the  beat  frequency  is  much 
lower  than  the  frequency  of  oscillation  (Q  <c  Wj),  the  variable 
quantity  |  A  (t)  |  is  conditionally  called  the  beat  amplitude. 
The  beat  period  T-p  and  the  beat  frequency  vp  are  equal  to 


'/i. 


2n  2 n 

£2  |  co2  —  aq  | 


1 
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and 

t  ,  I 

Vh=  v2— |, 

where  Tx,  vlt  T2  and  vs  are  the  periods  and  frequencies  of  the 
superposed  oscillations.  Curves  of  the  dependence  of  s  on  t  in 
beating  are  shown  in  Fig.  28.9  for  the  case  when  Ax  =  A2  = 


28.4.6  As  a  result  of  the  superposition  of  harmonic  oscillations, 
coinciding  in  direction  and  having  angular  frequencies  that  are 
integral  multiples  of  their  lowest  frequency,  i.e.  w,  2co,  3o>,  etc., 
a  periodic  nonharmonic  oscillation  with  the  period  T  =  2ji/co 
is  obtained.  On  the  other  hand,  any  complex  periodic  oscillation 
4-  =  /  (/)  can  be  represented  as  the  sum  of  simple  harmonic 
oscillations  having  angular  frequencies,  which  are  integral 
multiples  of  the  basic  angular  frequency  =  2nlT,  where  T  is 
the  period  of  the  oscillation.  Thus 

OO 

s  =  /  (/)  =  =-y-  +  2  (a"  cos  »®t  +  6n  sin  n(,)l ) 
n= 1 


— -y--h  ^  An  sin  (nW/  +  Cp„), 
n—\ 
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where 

r/2 

a  n  —  ~jT  ^  /  (t)  cos  nwt  dt  («  =  0,1,2,...). 

-T/2 

T/2 

l)n=-jr  ^  /  (f)  sin  nto*  df  *  (n  =  l,  2,  ...). 

-T/2 

Such  a  representation  of  the  periodic  function  /  (t)  is  called  the 
expansion  of  this  function  into  a  Fourier  series,  or  the  harmonic 
analysis  of  a  complex  periodic  oscillation.  The  terms  of  the  Fourier 
series  corresponding  to  the  harmonic  oscillations  with  the  angu¬ 
lar  frequencies  <n,  2w,  3<o,  etc.  arc  called  the  first,  or  fundamental, 
second,  third,  etc.  harmonics  of  the  complex  periodic  oscillation 
s  =  f  (t).  The  whole  set  of  these  harmonics  forms  the  spectrum 
of  the  oscillations  s  =  /  (l).  The  composition  of  the  spectrum 
depends  on  the  form  of  the  periodic  function  s  =  /  (<).  In  the 
simplest  cases,  the  spectrum  can  consist  of  a  small  number  of 
harmonics. 

The  oscillation  spectrum  is  often  understood  to  be  the  frequency 
spectrum,  i.e.  the  set  of  frequencies  of  simple  harmonic  oscil¬ 
lations  that  can  be  superimposed  to  obtain  the  complex  oscil¬ 
lation  being  considered.  Periodic  oscillations  have  discrete 
(line)  frequency  spectra. 

28.4.7  Nonperiridic  oscillations  have,  as  a  rule,  continuous  fre¬ 
quency  spectra,  i.e.  they  can  be  represented  as  the  result  of  the 
superposition  of  a  great  many  harmonic  oscillations,  whose 
frequencies  have  all  possible  values  in  a  certain  range  (from  0  to 
oo  in  the  general  case).  The  harmonic  analysis  of  such  oscil¬ 
lations  consists  in  presenting  them  in  the  form  of  a  Fourier 
integral: 

OO 

s  — /  ( t )  =  J  [a  (w)  cos  wt  +  6  (co)  sin  cot]  dw, 
o 

where 

oo  oo 

,((0))=  -i-  j  /  (l)  cos  dig  dl,  6(<d)  =  -^-  \  /  (£)  sin  tag  d*. 

—  8  ,  -oo 
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Certain  nonperiodic  oscillations,  called  almost  periodic  (quasi- 
periodic)  oscillations  have  a  line  frequency  spectrum.  But  the 
included  angular  frequencies  are  incommensurable  with  one 
another,  because  their  ratios  are  expressed  by  irrational  numbers. 
28.4.8  Modulation  is  the  change  of  any  parameter  of  a  periodic 
oscillation  (for  instance,  the  amplitude  or  frequency)  according 
to  a  definite  law  during  a  length  of  time  considerably  longer 
than  the  period  of  the  oscillation. 

In  amplitude  modulation,  for  example,  of  the  harmonic  oscillation 
s  —  A0  sin  (ay  -f-  (p0),  the  modulated  oscillation  is  of  the  form 

s  =  A0  [1  +  b  (/)]  sin  (oy  +  cp0), 

where  |  b  (t)  |  <  1. 

If  amplitude  modulation  is  carried  out  according  to  the  harmonic 
law  b  ( t )  =  bu  cos  f It,  where  b0  =  const  and  £2  <£  co0,  then 

s  =  A0  (1  +  b0  cos  Qt)  sin  (oy  <p0). 

This  modulated  oscillation  has  a  line  frequency  spectrum  because 
it  can  be  represented  as  the  sum  of  three  harmonic  oscillations 
with  the  angular  frequencies  co0,  (o„  —  Q  and  co0  -f-  Q,  and 
amplitudes  equal,  respectively,  to  A0,  A0b0/2  and  A0b0/2.  Thus 

A0  (1  +  b0  cos  Ql)  sin  (ut0t  +  c p0 ) 

=  A0  si  n  (<n0t  +  q  0)  +  {si  n  [(co0  +  £2)  t  +  cp0] 

4- sin  [(co0  —  Q)  H-'Pol}- 

In  frequency  modulation  of  the  harmonic  oscillation  s  =  A0  X 
X  sin  (w0Z  +  To),  carried  out  according  to  a  harmonic  law,  the 
modulated  oscillation  is  of  the  form 

s  =  A0  sin  [co0  (1  +  b0  cos  Qt)  t  +  <p0], 

where  t>0  <  1  and  Q  <  w„. 

Accordingly,  in  phase  modulation,  the  initial  phase  of  the 
oscillation  is  changed.  Thus 

s  —  A„  sin  (ay  +  Atp  cos  Qt),  where  12  <  «n. 

In  the  general  case  oscillations  can  be  modulated  simultaneously 
in  amplitude  and  in  phase  (or  frequency).  An  example  of  such 
modulated  oscillations  is  beating  (28.4.4). 
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28.4.9  Addition  oj  mutually  perpendicular  harmonic  oscillations 
n/  equal  frequency.  Assume  that  point  M  oscillates  simultaneous¬ 
ly  along  the  coordinate  axes  OX  and  OY  according  to  the  law: 
x  —  Ax  sin  (1 oi  +  qq)  and  y  =  A2  sin  (0 n  -j-  cp2),  where  x  and 
;/  are  the  Cartesian  coordinates  of  point  M.  The  path  equation 
for  the  resultant  motion  of  point  M  in  plane  XOY  can  be  found 
by  eliminating  parameter  /  from  the  expression  for  x  and  y. 
Thus 


df (fP2_fp]Ksin2  (<f2~cp,)- 

This  path  is  an  ellipse  (Fig.  28.10)  and  point  M  describes  this 
ellipse  during  a  time  equal  to  the  period  of  the  oscillations  being 


added  together  (T  —  2n/a>).  Hence,  the  resultant  motion  of 
point  M  is  called  elliptically  polarized  oscillation. 

The  orientation  of  the  axes  of  the  ellipse  in  plane  XOY ,  as  well 
as  its  dimensions  depend  upon  the  amplitudes  Ax  and  A2  of  the 
oscillations  being  combined  and  on  their  initial  phase  difference 
cp2  —  <Pi-  If  (p2  —  (Pi  =  (2 m  -f  1)  n/2,  where  m  =  0,  ±1,  ±2, 
the  axes  of  the  ellipse  coincide  with  the  coordinate  axes  OX  and 
OY,  and  the  lengths  of  the  semiaxes  are  equal  to  the  amplitudes 
Ax  and  A  2.  Thus 
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If,  in  addition,  Ax  —  d2,  the  path  of  point  M  is  a  circle.  Such 
resultant  motion  of  point  M  is  called  circularly  polarized  oscil¬ 
lation. 

When  <p2  —  <Pi  =  mn,  where  m  =  0,  ±1,  ±2,  .  .  .,  the  ellipse 
degenerates  into  a  straight  lino.  Thus 


The  plus  sign  corresponds  to  even  values  of  m,  i.e.  to  the  addition 
of  cophased  oscillations  (Fig.  28.11a);  the  minus  sign,  to  odd 
values  of  m,  i.e.  to  the  addition  of  oscillations  that  arc  in 


(a) m  =0,  1  2 , 1 4,... 


(ijm  -  ±7,  tJ,  ... 


opposition  (antiphased)  (Fig.  28. life).  In  either  case,  point  M 
executes  linearly  polarized  oscillation.  It  oscillates  with  harmonic 
motion  at  the  frequency  of  the  oscillations  being  combined  and 
with  the  amplitude  A  =  Y A\ A  \  along  a  straight  line  that 
makes  the  angle  a  =  arc  tan  (A2  cos  rnji/A,)  with  the  axis  OX. 
28.4.10  In  the  addition  of  mutually  perpendicular  oscillations 
having  the  angular  frequencies  p<»  and  <7 w,  where  p  and  q  are 
whole  numbers,  we  obtain 

x  Ax  sin  (part  +  <pj)  and  y  =  A2  sin  (q<nt  +  tp2). 

The  values  of  coordinates  x  and  y  of  the  oscillating  point  M  are 
simultaneously  repeated  after  equal  periods  T „  of  time,  equal 


28.4]  ADDING  HARMONIC  OSCILLAi'IONS 


383 


to  the  least  common  multiple  of  =  2 nlpw  and  T 2  —  2niqi», 
which  arc  the  periods  of  oscillation  along  the  OX  and  OY  axes. 
Hence,  the  path  of  point  M  is  a  closed  curve  whose  shape  depends 
upon  the  ratios  of  the  amplitudes,  frequencies  and  initial 
phases  of  the  oscillations  being  combined.  Such  closed  paths  of 
point  M,  which  has  simultaneous  harmonic  oscillations  in  two 
mutually  perpendicular  directions,  are  called  Lissajous  figures. 


Fig.  28.12 


Lissajous  figures  are  inscribed  into  rectangles  whose  centre  coin¬ 
cides  with  the  origin  of  coordinates  and  whose  sides  are  parallel 
to  coordinate  axes  OX  and  OY.  The  distances  of  these  sides  from 
the  axes  are  equal,  respectively,  to  A2  and  At.  The  ratio  of  the 
frequencies  qa>  and  pu>  of  the  combined  oscillations  is  equal  to 
the  ratio  of  the  number  of  times  the  corresponding  Lissajous 
figure  touches  a  side  of  the  rectangle  parallel  to  the  OX  axis  to 
the  number  of  times  it  touches  a  side  parallel  to  the  OY  axis. 
Shown  in  Fig.  28.12  are  the  Lissajous  figures  for  three  different 
values  of  the  ratio  q/p  (2:1,  3:2  and  4:3)  and  for  initial  phase 
differences  of  Aq>  —  <p,  —  cp2  =  it/2. 
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CHAPTER  29  DAMPED  AND  FORCED  OSCILLATIONS 


29.1  Damped  Oscillations 

29.1.1  The  damping  of  an  oscillation  or  vibration  is  its  gradual 
weakening  in  the  course  of  time  duo  to  the  loss  of  energy  of  the 
oscillatory  (vibratory)  system.  Free  oscillations  (vibrations) 
of  real  systems  arc  always  damped.  The  damping  of  free  mecha¬ 
nical  vibrations  is  duo  chiefly  to  friction  and  the  excitation  of 
elastic  waves  (30.1.3)  in  the  surrounding  medium.  Damping  in 
electrical  oscillatory  systems  is  due  to  thermal  losses  in  the 
conductors  making  up  the  system  or  located  in  its  variable 
electric  field,  energy  losses  on  the  radiation  of  electromagnetic 
waves  (31.1.1),  and  heat  losses  in  dielectrics  and  ferromagnetic 
materials  as  a  result  of  electrical  and  magnetic  hysteresis 
(18.4.5  and  26.5.2). 

The  damping  of  oscillations  depends  upon  the  properties  of 
the  oscillatory  system.  A  system  is  said  to  be  linear  if  the  para¬ 
meters  characterizing  physical  properties  of  the  system,  essential 
in  the  process  being  considered,  do  not  change  in  the  course 
of  the  process.  Linear  systems  are  described  by  linear  differential 
equations.  A  spring  pendulum  (28.2.3),  for  instance,  moving  in 
a  viscous  medium,  is  a  linear  system  if  the  resistance  coefficient 
of  the  medium  and  the  elasticity  of  the  spring  are  independent 
of  the  velocity  and  displacement  of  the  pendulum.  An  electrical 
oscillatory  circuit  (28.3.1)  can  be  considered  linear  if  its  elec¬ 
trical  resistance  R ,  capacitance  C  and  inductance  L  arc  indepen¬ 
dent  of  the  current  in  the  circuit,  and  the  voltage.  In  the  majority 
of  cases  real  oscillatory  or  vibratory  systems  arc  sufficiently 
close,  with  respect  to  their  properties,  to  linear  ones. 

29.1.2  The  differential  equation  for  the  free  damped  oscillation  of 
a  linear  system  is  of  the  form: 


d3s 
dt 3 


+  2P-I  +  1-0. 


Here  s  is  the  physical  characteristic  of  the  system  that  varies 
in  the  oscillation,  P  =  const  >  0  is  the  damping  factor ,  and 
co„  is  the  angular  frequency  of  free  undamped  oscillation  of  the 
same  system,  i.e.  in  the  absence  of  energy  losses  (at  p  =  0). 
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F.xample  1.  Free  damped  oscillation  of  a  spring  pendulum  (28.2.3). 
A  pendulum  of  mass  m,  oscillating  in  a  straight  line  along  the  OX 
axis  due  to  the  elastic  force  of  its  spring,  is  also  subject  to  a 
resisting  force  Fres  =  —  bv,  where  v  is  the  velocity  of  the  pen¬ 
dulum  and  b  —  const  >  0  is  the  resistance  coefficient.  The  differ¬ 
ential  equation  of  free  damped  oscillation  of  the  pendulum  is 


m 


d'1.  v 
~dO 


or 


whore  P  =  bl2m,  =  V Id m,  and  k  is  a  factor  specifying  the 
elastic  properties  of  the  spring. 

Example  2.  Free  damped  oscillation  in  an  electric  oscillatory 
circuit.  The  electrical  resistance  of  a  real  circuit  R  0,  and  the 
differential  equation  of  the  oscillation  in  the  circuit  (28.3.1)  is 
of  the  form 

*-+2P§+»|„0, 

where  P  =  Rl2L  and  co0  =  1/] f  LC. 

29.1.3  If  damping  is  small  (P  <  ®0)  the  dependence  of  s  on  t 
that  satisfies  the  equation  of  damped  oscillation  (29.1.2)  is  of 
the  form 

.v  =  A0e sin  (<ot  +  ip0). 

Here  <o  =  Y oj|  —  P2  and  the  constant  quantities  An  and  o|)0 
depend  upon  the  initial  conditions,  i.e.  upon  the  values  of  s 
and  ds/dt  at  the  initial  instant  of  time  ( t  =  0).  The  curve  of 
the  dependence  of  s  on  t  at  ip0  =  0  is  shown  in  Fig.  29.1. 
Damped  oscillations  are  not  periodical  ones  (28.1.2).  For  exam¬ 
ple,  the  maximum  value  of  the  varying  quantity  s,  reached 
at  a  certain  instant  tx  of  time,  is  never  repeated  subsequently 
(at  t  >  tf).  But  the  quantity  s  becomes  zero  in  damped  oscil¬ 
lation  as  it  varies  in  a  single  direction  (decreases,  for  instance) 
and  also  reaches  its  maximum  and  minimum  values  after 
equal  time  intervals.  Thus 

T  _  2it _  2it 

~~  / ©§-p2  ‘ 


25-0186 
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For  this  reason,  the  quantities  T  and  co  are  conditionally  called 
the  period  (conditional  period)  and  angular  frequency  (conditional 
angular  frequency)  of  damped  oscillation. 

The  quantity 

A  =  A0e -p< 

Is  called  the  amplitude  of  damped  oscillation,  where  A0  is  the 
i  nitial  amplitude.  The  amplitude  of  damped  oscillation  de¬ 


creases  in  the  course  of  time;  the  greater  the  damping  factor  P, 
the  more  rapidly  the  amplitude  decreases. 

The  time  interval  t  =  1/p  during  which  there  is  an  e-fold  de¬ 
crease  in  the  amplitude  of  damped  oscillation,  is  called  the 
relaxation  time. 

29.1.4  The  logarithmic  decrement  of  damping  is  the  dimensionless 
quantity  6,  equal  to  the  natural  logarithm  of  the  ratio  of  the 
amplitudes  of  damped  oscillation  at  the  instants  of  time  t  and 
t  -f-  T  (where  T  is  the  conditional  period  of  oscillation).  Thus 


8= In 


A(t) 
A(t  +  T) 


where  N  is  the  number  of  oscillations  in  which  the  amplitude 
is  reduced  e-fold. 
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The  relation  between  the  angular  frequency  co  of  damped  oscil¬ 
lation  of  a  system  and  the  logarithmic  decrement  of  damping 
6  is 


“=“•/ i-(^)2(4r)2  • 

20.1.5  The  Q  factor  (or  quality  factor  or  storage  factor )  of  an 
oscillatory  system  is  the  dimensionless  quantity  Q,  equal  to  the 
product  of  2n  by  the  ratio  of  the  energy  of  oscillation  E  ( t )  of 
the  system  at  the  arbitrary  time  instant  t  to  the  decrease  of  this 
energy  during  the  length  of  time  from  t  to  t  -f-  T,  i.e.  during 
one  conditional  period  of  damped  oscillation.  Thus 


Q  =  2jx 


E(l) 

E(t)-E  (t  +  T)‘ 


Since  the  energy  E  (t)  is  proportional  to  the  square  of  the  ani¬ 
mation, 


plitude  A  ( t ) 
0  =  2n 


of  the  oscil 
A2  ( t ) 


2jt 


2jt 


A 2  (t)  — A2  (t  +  T) 


l—e 


-2pr 


l  —  e 


-26 


At  low  values  of  the  logarithmic  decrement  of  damping  8,  the 
Q  factor  of  an  oscillatory  system  is  Q  =  nib.  Here  the  con¬ 
ditional  period  of  damped  oscillation  T  is  practically  equal  to 
the  period  T0  of  free  undamped  oscillation  and,  consequently, 
Q  =  rt/pr0  =  co0'/2{S.  For  example,  the  Q  factor  of  an  electrical 
oscillatory  circuit  (28.3.1)  is  Q  =  Y LICIR,  and  that  of  a 
spring  pendulum  (29.1.2)  is  Q  = 

29.1.6  The  conditional  period  of  damped  oscillation  increases 
with  the  damping  factor  P  and  becomes  infinite  at  P  =  <n0.  If 
P  >  <o„  the  differential  equation  of  motion  of  the  system 


d2s 

~dW 


-2H-I-+.5.-0 


has  the  following  general  solution: 

■'=  Cie-ait  +  C2e-a2t, 


where  at  —  P  +  Y ft2  —  wo-  a2  =  P  —  ]^P2  —  and  Cx 
and  C2  are  constant  factors  depending  upon  the  initial  con- 
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ditions.  If  the  initial  values  (at  the  instant  of  time  t  =  0)  equal: 
s  —  s0  and  dsldt  =  v0,  then 


«2Sq  +  Up 
“l— a2 


and 


«xSo  +  F0 

«!  —  a2  ‘ 


Such  motion  of  the  system  is  not  of  an  oscillatory  nature  and 
is  said  to  be  aperiodic.  Depending  upon  the  initial  conditions, 


aperiodic  motion  of  the  system  may  be  either  of  two  kinds 
(Fig.  29.2).  Type  a  motion  occurs  when  s0  and  n0  are  opposite 
in  sign  and  |  v0  |  >  ax  I  so  !•  Otherwise,  the  aperiodic  motion 
is  of  the  b  type 


29.2  Forced  Mechanical  Vibration 

29.2.1  A  variable  external  force,  applied  to  a  system  and  causing 
its  forced  mechanical  vibration,  is  called  the  disturbing ,  or 
driving,  force. 

The  differential  equation  of  forced  vibration  (or  oscillation)  of  the 
simplest  linear  system  (29.1.1),  the  spring  pendulum  (29.1.2), 
executed  along  the  OX  axis  due  to  the  action  of  the  variable 
external  force  F  (l)  is: 

d*r 


dx 


1 
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I  f  Fx  ( t )  is  a  periodic  function  of  time,  then,  after  application 
of  the  force  to  the  pendulum,  the  forced  vibration  begins  with  a 
transient  condition.  The  pendulum  simultaneously  participates 
in  two  vibrations  : 


=  *1  (*)  +  *2  (<)• 

The  first  term  corresponds  to  free  damped  oscillation*  of  the 
pendulum  (29.1.3): 

(t)  =  A0e~et  sin  (m  ~f  \|>0), 


where  to  =  jAcog  —  p2 

The  second  term  corresponds  to  the  undamped  periodic  oscil¬ 
lation  of  the  pendulum  at  a  frequency  equal  to  that  of  the 
driving  force  Fx  (t). 

The  amplitude  value  of  ( t ),  equal  to  A0  exp  (—Pi),  is  reduced 
at  a  more  or  less  rapid  rate  after  the  beginning  of  the  forced 
vibration  (or  oscillation).  During  the  time  t  =  4.6/P  the  am¬ 
plitude  of  x1  (t)  decreases  by  a  factor  of  100.  Consequently,  after 
a  certain  time  %  following  the  beginning  of  vibration  (t  «  t0), 
free  vibration  of  the  pendulum  practically  ceases:  x  ( t )  »  x2  (t). 
The  pendulum  reaches  a  state  of  steady  forced  vibration  (or  oscil¬ 
lation)  at  the  frequency  of  the  driving  force. 

29.2.2  When  the  driving  force  varies  according  to  a  harmonic 
law,  i.e.  Fx  =  F0  cos  Qt,  the  steady-state  forced  vibration  of 
I  he  pendulum  is  also  harmonic  at  the  same  frequency.  Thus 

■r  =  A  cos  (Qt  +  q>0). 


The  amplitude  A  of  this  vibration  and  the  phase  shift  cp0  between 
I  ho  displacement  and  the  driving  force  depend  upon  the  relation 
between  the  angular  frequencies  of  the  forced  vibration  (Q)  and 
l  ho  free  undamped  vibration  (co0)  of  the  pendulum.  Thus 


m  Y (tog— Q2)2  +  4p2Q2 


and 


tan  cp0  = 


2PQ 

cog— Q2  • 


*  It  is  assumed  that  P  <  <n0.  Otherwise,  the  free  motion  of  the 
pendulum  would  be  aperiodic  (29.1.6),  i.e. 

0  ( t )  =  Cx  exp  [  — (P+  l^P2 — cog  )  t] 

I  C 2  exp  [  — (P— /p2  — cog)  <]. 
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At  Q  =  0  we  obtain  <p0  (0)  =  0  and  A  (0)  —  A0  =  F0/ma>l  — 
=  Fjk,  which  is  the  static  displacement  of  the  pendulum  from 


Fig.  29.3 

its  equilibrium  position  by  the  constant  force  Fx  =  F0.  At 
Q  oo,  the  amplitude  A  (Q)  0  and  tan  <p„  ->  0,  whereas 


q)0  — — n.  The  curves  of  the  functions  A  (Q)  and  cp0  (£2)  are 
shown  in  Figs.  29.3  and  29.4  for  various  values  of  the  damping 
factor  |1. 
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29.2.3  The  amplitude  of  the  displacement  in  the  case  of  steady- 
state  forced  harmonic  vibration  (oscillation)  of  the  pendulum 
reaches  its  maximum  value  at  the  angular  frequency 

iiP  =  Vaf1 W  =  Y  wa  —  P2, 

where  <o  is  the  angular  frequency  of  free  damped  vibration  of 
the  pendulum  (29.1.3).  The  frequency  £2r  is  called  the  resonance 
frequency. 

The  maximum  amplitude  is 


dmax  —  A  (Qp)  — 


Fo 

2  m  (l(o 


where  8  is  the  logarithmic  decrement  of  damping  (29.1.4). 
When  P  <  co„,  £2r  «  to0,  tp0  (Qr)  «  —nl 2  and  Amax  ~  QA 0, 
where  Q  ss  it/8  is  the  Q  factor  of  the  pendulum  (29-1.5)  and 
A  o  is  the  static  displacement  (29.2.2). 

The  drastic  increase  in  the  amplitude  of  forced  mechanical 
vibration  (or  oscillation)  as  the  angular  frequency  of  the  driving 
force  approaches  the  value  Qr  is  the  phenomenon  called  mechan¬ 
ical  resonance.  Accordingly,  the  curves  of  the  dependence  of  A 
on  Q,  shown  in  Fig.  29.3  are  called  resonance  curves. 

As  the  damping  factor  P  is  increased,  the  peaks  on  the  resonance 
curves  are  rapidly  reduced  (at  small  P  values  the  amplitude 
A max  *  1/P)  and  the  resonance  frequency  Qr  is  slowly  de¬ 
creased. 

29.2.4  The  velocity  of  the  pendulum  in  steady-state  forced  har¬ 
monic  vibration  (oscillation)  is 

dx 

Vx  =  -jr— — AS  sin  (Ql  -f-  cp0)  =  Ay  cos  (Ql  +  cc). 


Here  Av  =  dQ  and  a  =  (p0  -j-  it/2  are  the  amplitude  of  the 
velocity  and  the  phase  difference  between  the  velocity  and  the 
driving  force: 


Ay 


_ FpQ _ 

m  /(mjj  — 


and 

tan  a  —  — cot  fp0  = 


(ojj  —  Q2 
2PQ 


Fo 


Y' 


(<o| — Q2)2 
Q2 


+  4P2 
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The  amplitude  of  the  velocity  is  maximal  at  £2  =  <o0  and 
equals 

(^v)max  =  Av  (<°o)  =  2‘m°p  • 

In  this  case,  a  =  0,  i.e.  the  velocity  of  the  pendulum  oscillates 
in  phase  with  the  driving  force.  As  £2  oo,  the  amplitude 
Av -+0  and  a  -»•  —  n/2,  whereas  as  £2  -*■  0,  the  amplitude 
A  v  0  and  a  -4-  n/2. 

29.2.5  The  acceleration  of  the  pendulum  in  steady-state  forced 
harmonic  vibration  (oscillation)  is 

d^x 

ax  —  -  t —  AQ?  cos  (Q2+cp0)  =  Aacos  (£2t-)-y). 


Here  Aa  =  A  Q2  and  y  =  q>j  ji  are  the  amplitude  of  the 
acceleration  and  the  phase  difference  between  the  acceleration 
and  the  driving  force: 


I'oQ2 _ 

Y  K— £22)2+4p2£22 


_ l o _ 

m  Y  [(o>o/£2)2  —  l]2  +  (2p/Q)2  * 


The  amplitude  of  acceleration  is  maximal  at 


/l-Hp/fflo)* 


Y<*l~V2- 


At  £2  =  0,  the  amplitude  Aa  =  0,  whereas  as  £2  oo  the 
amplitude  of  the  acceleration  lends  to  the  value  Aa  (oo)  = 
=  F0/m. 

29.2.6  In  steady-state  forced  vibration,  the  energy  loss  of  the 
vibratory  system,  due  to  dissipative  forces  (1.3.7),  is  completely 
compensated  by  the  work  performed  on  the  system  by  the  driving 
force.  The  work,  for  example,  done  in  one  complete  oscillation 
by  the  force  of  resistance  acting  on  the  spring  pendulum  is 


J  (l)2 

o 

=  —  m§A2£22, 


T 

dt=  —  2mPA'3£22  ^ 
o 


sin2  (£2«  +  q>0)  dt 
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where  b  =  2mfi  is  the  resistance  coefficient  (29.1.2). 

The  work  done  during  the  same  time  by  the  driving  force  Fx  — 
=  F0  cos  Sit  is 


W. 


X  X 

F°  i  cosSit  dt— — AQFq  j 


=  — y  AQFo  sin  <Po=  —Wt, 


cos  Qf  sin  (Qf  +  q>0)  dt 


since  sin  ®0  =  —  2m$ASilF0. 

29.2.7  If  tne  driving  force  acting  on  the  spring  pendulum  varies 
periodically,  but  not  according  to  a  harmonic  law,  it  can  be 
represented  as  the  sum  of  the  harmonics  of  this  force  (28.4.6). 
These  have  different  amplitudes,  initial  phases  and  angular 
frequencies  that  are  multiples  of  Q  =  2nf T,  where  T  is  the 
period  of  variation  of  the  driving  force.  Since  the  pendulum  is 
a  linear  vibratory  (or  oscillatory)  system  (29.1.1),  each  har¬ 
monic  of  the  driving  force  acts  on  the  system  as  if  there  were 
no  other  harmonics.  Therefore,  the  steady-state  forced  oscil¬ 
lation  of  the  pendulum,  caused  by  an  arbitrary  periodic  driving 
force,  can  be  dealt  with  as  the  result  of  superposing  steady-state 
forced  oscillations  due  to  each  harmonic  component  of  the 
driving  force  acting  separately.  The  “contribution”  of  the  various 
harmonic  components  to  the  resultant  oscillation  depends  upon 
their  frequency  and  amplitude.  Owing  to  resonance  phenomena, 
an  essential  role,  is  played  by  only  the  harmonic  components  of 
the  driving  force  that  have  an  angular  frequency  close  to  the 
resonance  frequency  of  the  pendulum  (29.2.3).  If  the  damping 
factor  P  of  the  pendulum  is  small,  the  pendulum  can  execute 
steady-state  forced  oscillation  close  to  the  harmonic  one,  even 
if  the  driving  force  is  far  from  harmonic.  An  example  of  such 
forced  oscillation  is  that  of  a  pendulum  subject  to  short  periodi¬ 
cal  external  action  in  the  form  of  impacts  in  a  single  direction 
and  repeated  over  equal  lengths  of  time,  which  are  equal  to  the 
period  of  free  oscillation  of  the  pendulum. 


29.3  Forced  Electrical  Oscillation 

29.3.1  To  obtain  forced  oscillation  in  an  electrical  oscillatory 
circuit  (28.3.1)  it  is  necessary  to  connect  into  it  a  source  of 
electrical  energy,  whose  emf  %  varies  with  time  (Fig.  29.5). 
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In  electrical  engineering  a  source  of  electrical  energy,  charac¬ 
terized  by  its  emf  and  internal  electrical  resistance,  is  called  a 
seat ,  or  source ,  of  emf  ( voltage  source).  According  to  Ohm’s  law 
for  the  portion  l-R-L-2  (21.2.3),  the  quasi-stationary  current 

(28.3.1),  produced  in  the  cir- 
r>  n  R  cuit  in  forced  oscillation,  can 

r  — — i - 1 _ ,  be  found  from  the  formula 


^  r  Here  V2  —  =  q/C  is  the  po- 

e[ _ I  tential  difference  across  the  ca- 

f,  pacitor  plates,  q  is  the  capacitor 

*  / ,  i  charge,  and  the  internal  elec- 

I  ‘‘  trical  resistance  of  the  source 

Fill  9 9  5  °f  emf  is  assumed  to  be  neg- 

“  '  ligibly  small  compared  to  R 

(such  a  source  of  emf  is  said 
to  be  ideal).  From  the  law  of  conservation  of  electrical  charge 
(14.1.3)  it  follows  that/  =  dq/dt.  Hence,  the  differential  equa¬ 
tion  of  forced  electrical  oscillation  in  a  circuit  can  be  presented 
in  a  form  similar  to  the  equation  for  forced  mechanical  vibra¬ 
tion  (29.2.1): 

'4^+2pllH~a>°9  =  ~T  w 


Flere  P  =  R/2L  is  the  damping  factor  for  free  oscillation  in  a 
circuit,  and  «0  =  \iY  LC  is  the  angular  frequency  of  free 
undamped  oscillation  (i.e.  at  R  =  0). 

29.3.2  If  the  driving  emf  %  (i)  varies  according  to  the  harmonic 
law  %  ( t )  =  <£„  cos  Sit,  then  at  steady-state  forced  oscillation 
(29.2.1),  the  capacitor  charge  varies  harmonically  at  the  same 
angular  frequency  £2: 

9  =  90  cos  (Q*  +  <Po)- 

The  amplitude  q0  and  the  initial  phase  <p0  are  determined  by 
the  formulas 

_ So _ _ _ ^0 _ 

Q  Yr*-\-(SIL—  i/QC)2  L  /(wj— Q*)H-4P-2Q2 


?o  = 
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and 

.  R  28Q 

tan  <p0  -  QL  _  l/QC  —  —  _  Q2  • 

At  Q  =  0,  the  phase  <p0  (0)  =  0  and  g0  (0)  =  <£0C,  which  is  the 
charge  of  the  capacitor  at  a  constant  potential  difference  between 
the  plates  equal  to  <S0.  At  Q  oo,  the  amplitude  g0  0,  and 
<p0  ->  — n.  The  curves  showing  the  dependence  of  cp0  on  Q  are 
shown  in  Fig.  29.4,  and  those  of  the  dependence  of  g0  on  Q,  in 
Fig.  29.3,  where  A  =  g0  and  A0  =  g0  (0)  =  <S0C. 

29.3.3  With  steady-state  forced  oscillation  in  the  circuit,  the 
current  is 

I  —  — <70^  S'R  (12i  +  (Po)  =  ^oCOS  (Qf — <p). 


The  amplitude  of  the  current  /„  =  qQSl  and  the  initial  phase 
— <P  =  To  +  n/2  are  determined  by  the  formulas: 


h  = 


g0 


/  RiJr{Q.L — l/QC)2 


and  tan  cp  = 


QL  — l/QC 
R 


Curves  showing  the  dependence  of  /0  on  Q  for  various  values  of 
R  are  called  resonance  curves  oj  the  oscillatory  circuit  and  are 


shown  in  Fig.  29.6;  curves  of  the  dependence  of  cp  on  Q  are  shown 
in  Fig.  29.7.  The  resonance  angular  frequency  Qr,  corresponding 
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to  the  maximum  amplitude  of  the  current  in  the  circuit  in  forced 
oscillation,  is  independent  of  R.  Thus 

1 

Qr  —  cdq  =  — r —  , 

Y  LC 

The  current  amplitude  at  resonance  is  /„  (fir)  =  (?„//?,  and 
the  phase  difference  between  the  current  and  emf  is  q>  (Qr)  =  0. 
When  Q  <  o)0,  q)  <  0,  i.e.  the  current  leads  the  emf  in  phase; 
the  lower  the  frequency  Q,  the  greater  this  phase  lead  (cp  =  — n/2 
at  Q  =  0).  When  Q  >  u0,  tp  >  0,  i.e.  the  current  lags  in  phase 
behind  the  emf;  the  higher  the  frequency  Q,  the  greater  this 
phase  lag  (cp  n/2  as  Q-+  oo). 

29.3.4  The  potential  difference  across  the  terminals  a  and  b 
of  an  ideal  source  of  harmonic  emf  (Fig.  29.5)  is  equal  to  its 
emf.  Thus 

u  =  va  -  Vb  =  %0  cos  Qt. 


The  potential  drops  across  various  portions  of  the  circuit  il¬ 
lustrated  in  Fig.  29.5  and  having  the  variable  sinusoidal  (har¬ 
monic)  current  /  =  /„  cos  (Qt  —  <p),  a  capacitor  of  capacitance 
C,  resistor  of  resistance  R  and  a  coil  of  inductance  L  are  as 
follows: 

uc--=V2—  V1  =  -£-=Uc  cos  (til  —  q>— y)  , 

uji  —  IR  —Uji  cos  (Qt  —  cp), 

ul  =  L  ^-=Ul  cos  (Qt— (p  +  ^  . 

The  variation  of  uR  is  in  phase  with  that  of  current  I  in  the 
circuit,  ul  leads  the  current  in  phase  by  n/2,  and  uc  lags  behind 
the  current  in  phase  by  n/2.  Here 

UC  +  UR  +  UR  =  U  —  $0  C0S  Of- 

29.3.5  The  amplitude  values  of  uc,  uj.  and  uR  are  equal,  re¬ 
spectively,  to 


Uc  =  XCI0,  UL  =  XLI0  and  UR  =  RI, 
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where  Xq  —  1 IQC  is  the  capacitive  reactance  of  the  circuit,  and 
Xl=  Q L  is  the  inductive  reactance  of  the  circuit.  The  quantity 
X  =  Xl  —  Xc  =  QL  —  l/'QC'  is  called  the  reactance  of  the 
circuit,  R  is  called  the  resistance  and  Z  =  Y  R2  +  (Qi/  —  1/QC)2 
is  called  its  impedance. 

The  formulas  in  Sect.  29.3.3  for  the  amplitude  of  the  sinusoidal 
current  in  the  circuit  and  its  initial  phase  can  be  rewritten  in 
the  form:  10  —  %0IZ  and  tan  cp  =  XlR,  with  cos  cp  =  RlZ 
and  sin  <p  =  X/Z. 

At  resonance  Q  =  1  !Y LC  and  X ^  =  Xc,  so  that  the  reactance 
of  the  circuit  becomes  zero  and  the  impedance  roaches  its 
minimum  value,  which  is  equal  to  the  resistance  R.  Thus 


X  (Qr)  =  0  and  Z  (Qr)  =  Zmln  =  R. 


In  this  case  U R  —  £'0  and  Uc  =  U L  =  Y LfC'Sj R. 

29.3.6  The  effective  value  of  a  periodic  current  (and,  accordingly, 
the  emf,  voltage,  etc.)  is  the  root-mean-square  value  of  the 
current  during  the  time  T  of  its  variation.  Thus 


eff 


Pdt. 


For  a  sinusoidal  current  and  a  sinusoidal  emf 


■^efT —  — and  — 


/  2 


eff ' 


^0 
/  2 


An  element  of  work  done  by  sinusoidal  current  during  the  short 
time  dt  in  the  circuit  illustrated  in  Fig.  29.5  is  equal  to 


6FF  =  Iu  dt  —  /0W0  cos  (Ql  —  cp)  cos  Qt  dt. 


The  instantaneous  power  of  the  current  in  the  circuit  is 


f’lns  — 


m 


dt 


Iu=I0e0  cos  (fit  —  <| )  cos  Qt. 
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The  average  value  of  the  instantaneous  power  during  a  period 
is  called  the  active ,  or  true,  power  P  of  the  current  in  the  electric 
circuit.  Thus 

T 

If  1 

P  =  ~Y~  J  ^*lns  dt  —  —  /o<So  C03  cp=  cos  <p. 

0 

The  factor  cos  q>  is  called  the  power  factor.  Since  /eff  =  Seff/Z 
and  cos  cp  =  R/Z, 

P  =  -p-*?ir- 

At  resonance  Z  =  R  and  the  active  power  reaches  its  maximum 
value 


CHAPTER  30  ELASTIC  WAVES 

30.1  Longitudinal  and  Transverse  Waves  in  an 
Elastic  Medium 

30.1.1  A  body  is  said  to  be  elastic  and  its  deformation,  due  to 
the  application  of  external  forces,  is  elastic  deformation  if  it 
completely  disappears  after  the  external  forces  are  removed. 
According  to  Hooke's  law  elastic  deformation  is  directly  pro¬ 
portional  to  the  external  effects  that  cause  it,  i.e.  it  has  a  linear 
dependence  on  these  effects.  At  sufficiently  small  deformation, 
all  bodies  can  be  assumed,  practically,  to  be  elastic. 

The  elastic  properties  of  bodies  depend  upon  the  nature  of  the 
thermal  motion  of  their  molecules  and  their  forces  of  interaction. 
For  example,  a  gaseous  body  changes  its  shape  without  hind¬ 
rance  to  suit  the  shape  of  the  vessel  it  is  contained  in;  gas  has  no 
elasticity  of  shape.  But  a  gas  has  elasticity  of  volume,  i.e.  the 
capacity  to  resist  changes  in  its  volume.  This  property  of  a  gas 
is  due  to  the  thermal  motion  of  its  molecules  and  is  manifested 
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in  the  change  in  the  pressure  p  of  the  gas  when  its  volume  V  is 
changed.  According  to  Hooke's  law  for  volume  deformation ,  the 
change  dp  in  the  pressure  of  a  gas  for  a  small  change  dV  in  its 
volume  is  directly  proportional  to  the  relative  volume  defor¬ 
mation,  called  the  volume  strain.  Thus 


where  K  is  called  the  bulk  modulus  of  elasticity  of  the  gas.  For 
an  ideal  gas  (8.4.1)  the  value  of  K  depends  upon  the  kind  of 
thermodynamic  process  of  compression  (or  expansion)  being 
carried  out.  At  a  very  slow  change  in  the  volume  of  the  gas,  the 
process  can  be  considered  isothermal  (8.3.7),  whereas  at  a  rapid 
change  it  can  be  considered  adiabatic  (8.3.7).  In  the  former  case 
K  =  p  and  in  the  latter  K  =  xp,  where  x  is  the  adiabatic 
exponent  (9.5.11). 

30.1.2  The  elasticity  of  a  crystalline  solid  is  due  to  the  forces 
of  mutual  attraction  and  repulsion  of  its  particles  (ions,  atoms 
or  molecules)  that  make  up  the  body  and  that  have  random  ther¬ 
mal  vibration  about  the  points  of  its  crystal  lattice.  The  forces 
of  interaction  of  the  particle  impede  deformation  of  the  crystal 
lattice  that  is  associated  with  both  the  volume  and  shape  of  the 
body.  Therefore,  in  addition  to  their  bulk  elasticity,  solids  also 
have  elasticity  of  form,  which  is  manifested  by  their  resistance 
to  shear  deformation  (40.3.9). 

The  elasticity  of 'liquids  is  also  due  to  the  forces  of  intermolecular 
interaction.  But,  owing  to  the  fact  that  the  average  time  x  that 
molecules  of  a  liquid  remain  in  a  temporarily  settled  state 
(13.2.4)  is  very  short,  liquids,  like  gases,  possess  only  bulk 
elasticity.  They  manifest  elasticity  of  form  only  with  respect 
to  variable  deformation  of  ultrahigh  frequency  with  a  period 
that  is  less  than  or  of  the  order  of  x. 

30.1.3  Elastic  waves  are  mechanical  disturbances  (deformations) 
propagated  in  an  elastic  medium.  Bodies  that  act  on  the  medium 
and  cause  these  disturbances  are  called  sources  of  the  waves. 
A  member  of  the  audience  in  a  theatre,  for  instance  hears  the 
speech  and  singing  of  the  actors  and  the  sounds  of  the  musical 
instruments  due  to  the  variations  in  the  pressure  of  the  air, 
reaching  the  audience  and  caused  by  the  sources  of  sound. 
Sonic,  or  acoustic ,  waves  are  elastic  waves  of  low  intensity,  i.e. 
weak  mechanical  disturbances  propagated  in  an  elastic  medium. 
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Sonic  waves,  acting  on  the  organs  of  hearing  of  people,  are 
capable  of  producing  auditory  sensations  if  the  frequency  v  of 
their  corresponding  vibrations  lies  in  the  range  from  16  to 
2  X  104  Hz  (audible  sound).  Elastic  waves  with  a  frequency 
v  <  16  Hz  are  said  to  be  infrasonic,  whereas  those  with  a  fre¬ 
quency  v  >  2  X  104  Hz  are  ultrasonic  (elastic  waves  with 
v  >  109  Hz  are  often  said  to  be  hypersonic). 

30.1.4  The  propagation  of  elastic  waves  in  a  medium  does  not 
involve  the  transport  of  matter*.  In  a  boundless  medium,  it 
consists  in  exciting  forced  vibrations  of  parts  of  the  medium 
that  are  farther  and  farther  away  from  the  source  of  the  waves. 
It  is  feasible  here  to  digress  from  the  discrete  (molecular)  struc¬ 
ture  of  the  medium,  dealing  with  it  as  a  continuous  medium, 
distributed  continuously  in  space  and  possessing  definite  elastic 
properties.  A  particle  of  such  a  medium,  having  forced  vibration, 
is  understood  to  be  a  small  element  of  its  volume  whose  size, 
however,  is  many  times  greater  than  the  intermolecular  distances 
so  that  it  contains  an  enormous  number  of  molecules.  Practical¬ 
ly,  such  a  particle  can  be  dealt  with  as  a  point  because  even  in 
a  gas  the  intermolecular  distances  are  extremely  small  (of  the 
order  of  10-8  m  under  standard  conditions). 

30.1.5  An  elastic  wave  is  said  to  be  longitudinal  if  the  particles 
of  the  medium  vibrate  in  the  direction  of  wave  propagation. 
Longitudinal  waves  are  associated  with  bulk  deformation  of  the 
elastic  medium  and  can  therefore  propagate  in  any  kind  of 
medium:  solid,  liquid  or  gaseous.  An  example  of  such  waves 
are  the  sonic  waves  in  air. 

An  elastic  wave  is  said  to  be  transverse  if  the  particles  of  the 
medium  vibrate,  remaining  in  planes  perpendicular  to  the  direc¬ 
tion  of  wave  propagation.  Transverse  waves  are  associated  with 
shear  deformation  of  the  elastic  medium,  and,  consequently, 
can  be  generated  and  propagated  only  in  media  having  elasticity 
of  shape,  i.e.  in  solids.  An  example  of  transverse  waves  are 
those  propagated  along  the  strings  of  musical  instruments. 
Special  properties  are  found  in  surface  waves,  which  propagate 
along  the  free  surfaces  of  liquids  (or  the  interface  between  two 
immiscible  liquids),  being  a  perturbance  of  the  surface  due  to 


*  A  certain  transport  of  matter  (mass  transfer)  may  occur  in  the 
propagation  in  a  medium  of  strong  disturbances  (for  example, 
shock  waves  in  explosions),  when  the  vibration  of  the  particles 
of  the  medium  becomes  nonlinear. 
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external  effects  (dropping  of  bodies  into  the  liquid,  the  motion 
of  ships,  the  wind,  etc.).  The  forces  of  gravity  and  surface  tension 
play  a  decisive  role  in  the  generation  and  propagation  of  these 
waves.  In  surface  waves  the  particles  of  liquid  have  transverse 
and  longitudinal  vibrations  simultaneously,  describing  elliptical 
or  more  complex  paths. 

30.1.6  A  medium  is  said  to  be  uniform  if  its  physical  properties, 
essential  in  the  problem  being  considered,  do  not  change  from 
point  to  point.  A  medium  that  is  uniform  with  respect  to  cer¬ 
tain  physical  properties  may  be  nouniform  with  respect  to  others. 
A  monocrystalline  body,  for  example,  is  uniform  with  respect 
to  its  elastic  properties  and,  at  the  same  time,  optically  non- 
uniform  for  X-rays. 

A  medium  is  isotropic  if  its  physical  properties  that  are  essential 
in  the  problem  being  considered  are  the  same  in  all  directions. 
A  medium  that  is  isotropic  with  respect  to  certain  physical 
properties  may  be  anisotropic  with  respect  to  others.  Cubic 
crystals,  for  instance,  are  optically  isotropic,  but  are  anisotropic 
with  respect  to  other  properties.  Gases  and  liquids  are  isotropic 
with  respect  to  any  physical  properties. 

30.1.7  A  medium  is  said  to  be  linear  if  there  is  a  directly  pro¬ 
portional  relation  between  the  quantities  characterizing  the 
external  effects  being  considered  that  act  on  the  medium  and 
the  changes  they  lead  to  in  the  state  of  the  medium.  An  elastic 
medium,  for  example,  obeying  Hooke’s  law  (30.1.1)  is  linear 
with  respect  to  its  mechanical  properties.  A  dielectric  is  a  linear 
medium  with  respect  to  its  electrical  properties  if  its  dielectric 
constant  (14.2.4)  is  independent  of  the  electric  held  strength. 
By  analogy,  a  magnetic  material  is  a  linear  medium  with  re¬ 
spect  to  its  magnetic  properties  if  its  relative  magnetic  per¬ 
meability  (26.4.5)  is  independent  of  the  magnetic  induction  of 
the  field. 

30.2  Travelling  Wave  Equation 

30.2.1  Travelling  waves  are  ones  which,  in  contrast  to  standing 
waves  (30.5.4),  transport  energy  from  one  part  of  space  to 
another. 

The  travelling  wave  equation  is  the  dependence  on  the  coordinates 
and  time  of  the  scalar  or  vector  quantities  that  characterize  the 
vibration  of  the  medium  as  the  wave  being  considered  is  prop¬ 
agated  through  it.  For  waves  in  a  solid  medium,  for  instance, 
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such  a  quantity  may  be  the  vector  of  displacement  of  the  par¬ 
ticles  of  the  medium  (30.1.4)  from  their  equilibrium  position  or 
the  three  projections  of  the  vector  on  the  coordinate  axes.  Longi¬ 
tudinal  waves  in  a  gas  or  liquid  are  usually  characterized  by  the 
surplus  pressure  of  the  vibrating  medium,  equal  to  the  differ¬ 
ence  between  its  variable  and  equilibrium  pressures. 

A  ray  is  a  curve  whose  tangent  at  each  point  coincides  with  the 
direction  of  propagation  of  a  wave,  i.e.  with  the  direction  of 
energy  transport  by  the  wave  (30.3.5).  In  a  uniform  medium 
(30.1.6)  rays  are  straight  lines. 

30.2.2  An  elastic  wave  is  said  to  be  sinusoidal ,  or  harmonic ,  if 
the  corresponding  vibration  of  particles  of  the  medium  is  har¬ 
monic  (28.1.3).  The  frequency  of  this  vibration  is  called  the 
wave  frequency.  Variations  in  pressure  in  a  gaseous  or  liquid 
medium  through  which  a  sinusoidal  wave  propagates  also  take 
place  according  to  a  harmonic  law  at  a  frequency  equal  to  the 
wave  frequency.  In  a  transverse  sinusoidal  wave,  particles  of  the 
medium  can  simultaneously  vibrate  with  harmonic  motion  at 
the  wave  frequency  in  two  mutually  perpendicular  directions, 
each  of  which  is  perpendicular  to  the  direction  of  wave  prop¬ 
agation.  Depending  upon  the  kind  of  polarization  of  the  resultant 
vibration  (28.4.9),  distinction  is  made  between  the  following 
types  of  polarization  of  transverse  sinusoidal  waves:  elliptical, 
circular  and  linear  (or  plane). 

30.2.3  A  mechanical  disturbance  (deformation)  propagates  in 
an  elastic  medium  at  the  finite  velocity  v.  Therefore,  distur¬ 
bances,  initiated  by  the  wave  source  at  the  instant  t0  of  time, 
reach  an  arbitrary  point  M  of  the  medium  at  the  instant  t  >  t0. 
The  greater  the  path  l  that  the  wave  travels  from  its  source  to 
point  M,  the  greater  the  difference  t  —  t0  =  II v.  Consequently, 
the  vibration  at  point  M  lags  in  phase  behind  that  at  the  source 
of  the  waves. 

The  wave  surface,  or  wavefront,  is  the  locus  of  all  the  points  at 
which  the  phase  of  the  vibration  has  the  same  value  at  the 
instant  of  time  being  considered.  The  difference  t  —  t0  is  the 
same  for  all  points  in  a  wavefront.  Through  each  point  of  a 
medium  subject  to  wave  motion  one  wavefront  can  be  passed 
that  corresponds  to  the  value  of  the  phase  of  the  vibration  at 
this  point  at  the  instant  of  time  being  considered.  Many  different 
values  of  the  phase  of  a  vibration  correspond  to  a  family  of 
wavefronts.  In  a  uniform  isotropic  medium  (30.1.6)  the  wave- 
fronts  are  orthogonal  to  the  rays  (30.2.1). 
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;«).2.4  A  wave  is  said  to  be  plane  if  its  wavefronts  constitute 
a  system  of  parallel  planes.  In  a  plane  wave,  propagating 
along  the  OX  axis,  all  values  of  s,  characterizing  the  vibratory 
motion  of  the  medium,  depend  only  upon  the  time  t  and  the 

i  nordinate  x  of  the  point  M  being  considered  in  the  medium. 
II  there  is  no  absorption  of  waves  in  the  medium  (30.3.7),  the 
vibration  at  point  M  differs  from  that  at  the  origin  of  coordinates 

ii  only  in  that  it  is  shifted  in  time  by  the  amount  xlv,  where  v  is 
the  wave  velocity.  Hence,  in  a  plane  wave,  propagating  along 
the  positive  direction  of  the  OX  axis,  s  is  a  function  of  the  differ¬ 
ence  (t  —  xlv)  and  the  equation  of  such  a  plane  wave  is 


Accordingly,  the  equation  for  a  plane  wave  propagating  in  the 
opposite  direction  is 


30.2.5  The  equation  of  a  plane  sinusoidal,  or  simply  sine,  wave, 
propagating  in  a  tfonabsorbing  medium  along  the  positive 
direction  of  the  OX  axis,  is 


s  -A  sin 
or 

.«  A  sin 


[<n  (f  — -^)-fq>0]=-dsin(©f— z  +  (p0), 


/  2n  ^  2Jt  \ 


where  A  =  const  is  the  amplitude  of  vibration  and  is  called  the 
wave  amplitude,  0)  =  2n/T  is  the  angular  frequency  of  the  wave, 
'/’  is  the  period  of  vibration,  and  q>0  is  the  initial  phase  of  the 
vibration  (at  the  instant  of  time  t  —  0)  in  points  of  the  coordi¬ 
nate  plane  x  =  0.  The  quantity  (J)  =  m  —  (Ox/v  +  <p0,  equal  to 
i  lie  phase  of  the  vibration  at  an  arbitrary  point  with  the  coordi¬ 
nate  x,  is  called  the  phase  of  the  plane  wave  (see  also  Sect.  30.  2.6). 
30.2.6  The  distance  X  =  vT  that  a  sine  wave  travels  during  the 
lime  equal  to  the  period  of  vibration  is  called  the  wavelength. 
The  wavelength  is  equal  to  the  distance  between  two  closest 
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points  in  a  medium  in  which  the  phase  difference  of  the  vibration 
equals  2it. 

Along  with  the  wavelength,  use  is  made  of  another  characteris¬ 
tic  of  a  sine  wave.  It  is  the  wavenumber 

,  2n  2k  _ <d 


Hence,  the  equation  of  a  plane  sine  wave  (30.2.5)  can  be  written 
in  the  form 


(2ji  \ 

< at - jj—  z+q>oJ  =  Asm  (o)t—  fca;-)-<po). 


Accordingly,  the  phase  of  this  plane  wave  is  ®  =  co/  —  kx  +  qv 

30.2.7  The  wave  vector  is  the  vector  k,  equal  in  magnitude  to  the 
wavenumber  k  and  having  the  direction  along  the  ray  (30.2.1) 
at  the  point  M  being  considered  in  the  medium.  The  wave  vector 
of  a  plane  sine  wave  does  not  depend  on  the  choice  of  point  M, 
and  the  equation  for  such  a  wave  can  be  written  in  the  form 

s  =  A  sin  (wt  —  kr  -f  a), 

where  r  is  the  radius  vector  of  point  M,  and  a  is  the  initial 
phase  of  vibration  at  the  origin  of  coordinates,  i.e.  at  the  point 
r  =  0. 

On  the  basis  of  the  Euler  formula  (28.1.7),  the  equation  of  a  plane 
sine  wave  can  be  written  in  exponential  form,  which  is  con¬ 
venient  for  differentiation: 

r=.4ei(“i“kr+6), 

where  8  =  a  —  jt/2.  Only  the  real  part  of  the  complex  quantity 
sis  of  physical  meaning,  i.e.  the  quantity  s  =  Re  (s).  Therefore, 

in  making  use  of  s  to  determine  some  characteristic  of  a  wave, 
it  is  necessary  to  discard  the  imaginary  part  of  the  final  complex 
expression. 

30.2.8  A  wave  is  said  to  be  spherical  if  its  wavefronts  are  con¬ 
centric  spheres.  The  centre  of  these  spheres  is  called  the  wave 
centre.  Such  waves  are  generated  in  a  uniform  isotropic  medium 
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liy  a  solitary  point  source.  The  equation  of  a  diverging  spherical 
wave  is  of  the  form 


where  r  is  the  distance  from  the  wave  centre  to  the  point  M 
bring  considered  in  the  medium,  and  v  is  the  wave  velocity. 

I  n  I, lie  case  of  a  spherical  sine  wave 

—  sin  (coi  —  /rr  +  a), 
r 

where  a0!r  =  A  (r)  is  the  wave  amplitude  (30.3.6),  a0  is  a 
physical  quantity,  numerically  equal  to  the  wave  amplitude  at 
unit  distance  from  the  wave  centre,  a  is  the  initial  phase  of 
vibration  at  the  wave  centre,  and  <f>  =  cm  —  kr  +  a  is  the 
phase  of  the  spherical  wave. 

I  n  the  exponential  form,  the  equation  of  a  spherical  sine  wave  is 

v  ao  gi((ot-kr+6) _  ao  Jfat-hr-j-b ) 

r  r  ’ 

where  6  =  a  —  n/2,  f  is  the  radius  vector  from  the  wave  centre 
In  the  point  M  being  considered,  and  wave  vector  k  at  point  M 
im  iii  the  direction  radially  away  from  the  wave  centre. 

Ileal  sources  of  waves  are  always  of  finite  size.  But  they  can  be 
dealt  with  as  point  sources  and  the  waves  they  generate  in  a 
uniform  isotropic  medium  as  spherical  ones  if  the  distance  r 
from  the  source  to  the  points  being  considered  in  the  medium  is 
<  considerably  greater  than  the  dimensions  of  the  source.  When  r 
is  extremely  great,  any  small  portions  of  a  wavefront  can  be 
regarded  as  being  plane  surfaces. 

;i(l.2.9  The  propagation  of  waves  in  a  uniform  isotropic  medium 
i  nn  be  described  by  a  partial  differential  equation,  called  the 
wave  equation.  Thus 

ii-s  d2s  .  d2s _ 1  d2s  .  _  1  d2s 

i)x~  '  dy2  '  dz 2  vl  dt2  °T  S  v 2  dt2 

Mere  s  is  a  physical  quantity  that  characterizes  a  disturbance 
propagating  in  the  medium  at  the  velocity  v,  A  =  dVdx2  + 

|  d2/dy2  -j-  dfiidz1  is  the  Laplace  operator,  or  Laplacian.  This 
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equation  is  satisfied,  among  others,  by  a  plane  wave  (30.2.7) 
and  a  diverging  spherical  wave  (30.2.8). 

Function  s,  characterizing  a  sine  wave  in  a  uniform  isotropic 
medium,  satisfies  two  partial  differential  equations  at  the 
same  time: 

As  =  — k2s, 

where  k  is  the  wavenumber  (30.2.6),  and 
d2s 


dt2 


—  —  (0% 


where  to  is  the  angular  frequency  of  the  wave. 

30.2.10  The  velocity  v  of  propagation  of  a  sine  wave  is  called 
the  phase  velocity.  It  is  equal  to  the  velocity  of  displacement  in 
space  of  the  points  in  a  surface  that  corresponds  to  any  fixed 
value  of  the  phase  of  a  sine  wave.  For  example,  in  the  case  of  a 
plane  sine  wave  (30.2.6)  it  follows  from  the  condition  tot  —  kx  + 
+  q)0  =  const  that  dx/dt  =  to Ik  =  v.  In  the  same  way,  in  the 
case  of  a  spherical  sine  wave  (30.2.8),  it  follows  from  the  con¬ 
dition  cot  —  kr  -f-  a  =  const  that  drldt  —  co/ft  =  v. 


30.3  Phase  Velocity  and  Energy  of  Elastic  Waves 

30.3.1  The  phase  velocity  of  sonic  waves  ( velocity  of  sound)  in  a 
liquid  or  gas  is 


’-Vi- 


where  p  is  the  density  of  the  undisturbed  medium,  and  K  is 
the  bulk  modulus  of  elasticity  of  the  medium  (30.1.1).  The 
frequency  of  audible  sonic  waves  v  >  16  Hz,  and  the  process 
of  deforming  the  medium  can  be  assumed  to  be  adiabatic  (8.3.7), 
i.e.  K  —  —  V  (dp/dV)aa-  The  relation  between  the  pressure  p 
and  volume  V  for  an  ideal  gas  (8.4.1)  in  an  adiabatic  process  is: 
pV x  =  const,  where  %  is  the  adiabatic  exponent  (9.5.11),  so  that 
K  —  y.p.  Therefore,  the  velocity  of  sound  in  an  ideal  gas  is  equal 
to 
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where  fi  is  the  molar  mass  of  the  gas,  T  is  its  absolute  tempera¬ 
ture,  and  R  is  the  universal  gas  constant. 

00. 3. 2  The  phase  velocity  of  transverse  elastic  waves  in  a  uniform 
isotropic  solid  medium  is 


where  G  is  the  shear  modulus  of  the  medium  (40.3.9)  and  p  is 
its  density. 

The  propagation  of  longitudinal  waves  in  a  thin  long  rod  is 
associated  with  its  longitudinal  tension  and  compression. 
Hence,  the  phase  velocity  of  such  waves  is 


where  E  is  Young’s  modulus  (40.3.6)  for  the  rod  material. 

The  velocity  of  propagation  of  transverse  waves  along  a  stretched 
string  (a  thin  flexible  cord)  is 


where  F  is  the  stretching  force  (tension)  in  the  string,  and  p  and 
S  are  the  density  of  the  string  material  and  the  cross-sectional 
area  of  the  string. 

30.3.3  An  elastic  medium  in  which  mechanical  waves  propagate 
has  kinetic  energy  of  vibratory  motion  of  its  particles  and 
potential  energy  due  to  deformation  (30.1.1).  If  is  the  velocity 
of  particles  of  the  medium,  then  the  volume  kinetic  energy  density 
of  the  medium  is 


,,tl  dE  ^ 


2 


PP?. 


where  p  is  the  density  of  the  medium,  and  dE\  is  the  kinetic 
energy  of  all  the  particles  in  the  small  volume  d V  of  the  medium. 
This  volume  has  been  selected  so  that  throughout  it  the  velocity 
v,  is  the  same. 
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The  volume  potential  energy  density  of  an  elastically  deformed 
medium  is 


E,d_  dEV  1 

P  dV  2 


pi;2e2, 


where  dEv  is  the  potential  energy  of  a  uniformly  deformed 
small  portion  of  the  medium  of  volume  dV,  v  is  the  phase  ve¬ 
locity  in  the  medium,  and  e  is  the  relative  deformation.  .1 
The  volume  energy  density  of  elastic  waves  is  understood  to  be 
the  volume  density  EA  of  the  mechanical  energy  of  a  medium  due 
to  the  propagation  of  these  waves.  Thus 

*a=£g+tfg=-l-p(rj+p*e*). 


30.3.4  If  a  longitudinal  plane  travelling  wave  (30.2.4)  prop¬ 
agates  in  a  medium,  then  vx  =  ds/dt,  where  s  is  the  displace¬ 
ment  of  the  particles,  and  e  =  ds/dx  =  — vjv,  so  that 

£p  =  E$  and  £d  =  pi>?=p(-g-)2. 

At  each  point  of  the  medium  subject  to  wave  motion,  E\\  and 
E p  are  alike  functions  of  time.  Consequently,  Ea  also  varies 
with  time.  This  principle  is  valid  for  any  travelling  waves  in 
an  elastic  medium,  regardless  of  the  shape  of  their  wavefronts 
and  the  type  of  deformation  of  the  medium.  It  follows  from  the 
law  of  conservation  of  energy  as  applied  to  the  process  of  the 
propagation  of  vibrations  in  an  elastic  medium.  To  set  regions 
of  the  medium  more  and  more  distant  from  the  wave  source 
into  vibratory  motion  it  is  necessary  to  expend  energy,  which 
is  supplied  to  the  medium  by  the  source.  Consequently,  the 
propagation  of  elastic  waves  is  inseparably  associated  with  the 
transport  of  energy  from  certain  parts  of  the  medium  to  others. 
This  is  precisely  why  the  volume  energy  density  FA  of  the  waves 
depends  both  on  the  coordinates  and  on  the  time  as  well. 
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For  a  plane  travelling  sine  wave  in  a  nonabsorbing  medium 
(30.2.5) 

Ea  =  p/420)2  cos2  (at —  kx  +  cp0) 

=-|-p A2a2  [1  +  cos  2  — te+cp0)], 

where  A  =  const  is  the  wave  amplitude. 

In  the  case  of  a  diverging  spherical  sine  wave  in  a  nonabsorbing 
medium  (30.2.8) 

lid  —  pi42to2  cos2  (cm  —  kr  +  a), 

where  A  —  a  Jr  is  the  amplitude  of 
the  wave. 

The  average  volume  energy  density 
over  one  period  is 

<E< 1)  =  —pA^. 

30.3.5  The  velocity  of  energy  transfer 
by  a  wave  is  equal  to  the  velocity  in 
space  of  a  surface”  corresponding  to  the  maximum  value  of 
Ihe  volume  density  E rt  of  the  wave’s  energy.  For  sine  waves 
this  velocity  is  equal  to  the  phase  velocity  v. 

The  energy  flux  d<t>  E  through  the  small  surface  dS  is  the  ratio 
of  the  energy  dE  transferred  through  this  surface  in  a  short 
I  i  me  interval  to  the  magnitude  of  this  interval  dt.  Thus 

,/<!>  j?  =  dE/dt. 

If  v  is  the  velocity  vector  of  energy  transfer  by  the  wave 
(Fig.  30.1),  then 

dE  =  EAv  dt  dS  cos  a  =  Ed  (v  d S)  dt 
and 

d<\>E  =  Ed  (v  dS)  =  (U  dS), 

where  £d  is  the  volume  energy  density  of  the  wave,  dS  =  n  dS 
is  the  vector  of  surface  dS,  n  is  a  unit  vector  of  the  normal  to 
the  surface,  and  a  is  the  angle  between  v  and  dS. 
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The  vector  U  =  EAv,  having  the  direction  of  energy  transfer 
by  the  wave,  is  called  the  Umov  vector  ( energy  flux  density  vector 
of  the  wave).  In  magnitude  it  equals  the  ratio  of  the  energy  flux 
d< t>E  through  the  small  surface  dS  to  the  area  dS.  =  dS  cos  a 
of  the  projection  of  this  surface  on  a  plane  perpendicular  to  the 
direction  of  energy!  transfer:  V  =  d<t>E/dS  ,  . 

30.3.6  The  wave  intensity  I  is  the  magnitude  of  the  average  value 
of  the  Umov  vector.  Wave  intensity  is  numerically  equal  to 
the  energy  transferred  by  the  wave  in  unit  time  through  unit 
area  of  a  surface  perpendicular  to  the  direction  of  wave  prop¬ 
agation.  The  intensity  of  a  travelling  sine  wave  is  proportional 
to  the  square  of  its  amplitude.  For  plane  and  spherical  sine 
waves 


/=  |(U)|  —v(E&)  =  ~  pwoM2. 

When  a  spherical  wave  propagates  in  a  nonabsorbing  medium, 
the  same  amount  of  energy  is  transferred  in  unit  time  through 
any  spherical  surface  of  radius  r  with  its  centre  at  the  wave 
centre.  This  energy  is  equal  to  that  expended  during  the  same 
time  by  the  wave  source:  4 nr2I  =  const.  Hence,  the  intensity 
and  amplitude  of  a  spherical  wave  decrease  with  an  increase 
in  the  distance  from  the  wave  centre  according  to  the  relations 

f»  =  -pr  and  A(r')=-j-> 

where  i0  and  a0  are  physical  quantities  numerically  equal  to  the 
intensity  and  amplitude  of  the  wave  at  the  distance  r  =  1  m 
from  the  wave  centre. 

It  can  be  shown  in  the  same  way  that  in  the  case  of  a  plane  sine 
wave  in  a  nonabsorbing  medium,  the  amplitude  A  of  the  wave 
is  independent  of  the  coordinates. 

30.3.7  The  conversion  of  wave  energy  into  other  forms  of  energy, 
taking  place  in  the  propagation  of  waves  through  a  medium,  is 
called  wave  absorption.  In  a  uniform  medium  the  absorption  of 
elastic  waves  is  due  chiefly  to  the  processes  of  internal  friction, 
or  viscosity  (10.8.4),  and  heat  conduction  (10.8.5).  The  amplitude 
A  and  intensity  /  of  a  plane  wave,  propagating  in  an  absorbing 
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medium  along  the  positive  direction  of  axis  OX,  varies  according 
to  the  exponential  law 

A  (x)  =  A0e  ~  ax  and  I  (x)  =  I0e  ~  2ax. 

Here  A0  and  I0  are  the  amplitude  and  intensity  of  the  wave  at 
the  points  x  =  0,  and  a  is  the  linear  absorption  coefficient  for 
elastic  waves  and  depends  upon  the  properties  of  the  medium  and 
the  wave  frequency. 

30.3.8  The  dispersion  of  waves  is  the  dependence  of  the  phase 
velocity  of  sine  waves  in  a  medium  on  their  frequency.  A  medium 
in  which  this  phenomenon  is  observed  is  said  to  be  a  dispersive 
medium.  The  dispersion  of  acoustic  waves  in  a  boundless  me¬ 
dium  depends  upon  the  properties  of  the  medium  and  is  always 
accompanied  by  the  absorption  of  sound. 


30.4  Principle  of  Superposition  of  Waves.  Group 
Velocity 

30.4.1  The  principle  of  superposition  of  waves  states  that  in  a 
linear  medium  (30.1.7)  waves  are  propagated  independently  of 
one  another,  so  that  the  resultant  disturbance  at  any  point  in 
the  medium  upon  the  simultaneous  propagation  of  several  waves 
is  equal  to  the  sum  of  the  disturbances  corresponding  to  each 
of  the  waves  separately. 

For  example,  if  n  different  mechanical  waves  are  propagated 
simultaneously  through  a  linear  medium  the  resultant  displace¬ 
ment  s,  velocity  v  and  acceleration  a  of  the  particles  of  the 
medium  at  the  arbitrary  instant  t  of  time  are 

n  n  n 

s=N'1  s,-,  v=Y,  v i  and  a=Y,  a*, 

i=l  i= 1  7=1 

where  s;,  v;  and  a;  are  the  values  of  the  displacement,  velocity 
and  acceleration  that  the  particles  being  considered  would  have 
at  the  instant  of  time  t  if  only  the  ith  wave  was  propagated  in 
the  medium. 

30.4.2  On  the  basis  of  the  principle  of  wave  superposition  and 
a  Fourier  expansion  (28.4.6  and  28.4.7)  any  nonsine  wave  can  be 
replaced  by  an  equivalent  system  of  sine  waves,  i.e.  it  can  be 
represented  in  the  form  of  a  group  of  waves,  called  a  wave  train. 
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or  packet.  The  totality  of  frequency  values  for  these  sine  waves 
is  called  the  frequency  spectrum  (or  simply  spectrum)  of  the 
nonsine  wave.  In  accordance  with  the  nature  of  the  oscillation 
excited  by  the  wave,  the  wave’s  frequency  spectrum  can  be 
either  discrete  (28.4.6)  or  continuous  (28.4.7). 

The  law  of  propagation  of  anarbitrary  disturbance  (signal),  which 
is  a  nonsine  wave,  in  a  linear  medium  is  simple  only  provided 
that  the  medium  is  nondispersive  (30.3.8).  In  this  case  the  signal 
travels  in  the  medium  without  changing  its  “shape”  because  all 
the  sine  waves  making  up  the  wave  train  have  the  same  phase 
velocity,  which  is  equal  to  the  velocity  of  the  signal. 

In  a  dispersive  medium,  the  sine  components  of  the  wave  train, 
corresponding  to  the  nonsine  wave,  propagate  at  different 


H 


t, 

n 

1 1 — — 
JJ _ 


Fig.  30.2 


velocities.  Therefore,  this  wave  train  “spreads”  as  it  propagates, 
thereby  changing  the  “shape”  of  the  signal.  If,  for  example,  a 
signal  propagating  along  the  OX  axis  in  a  dispersive  medium 
had  the  “shape”  shown  by  dash  lines  in  Fig.  30.2  at  the  instant 
of  time  tlt  then  at  the  instant  t2  >  tx  it  could  have  the  different 
“shape”  shown  by  a  solid  line. 

30.4.3  The  simplest  wave  train  is  the  plane  quasi-sine  wave 
obtained  by  the  superposition  of  two  plane  waves  of  the  same 
amplitude  and  with  frequencies  and  wavenumbers  close  in 
value  propagating  along  the  OX  axis.  Thus 


s  —  A0  sin  (col —  kx)-\-A$  sin  [(co-f-dto)  t — (fe+  dk)  x] 

„  .  /  tdtit  —  xdk  \  .  .  ,  ,  , 

=  2.<40cos  I .  . I  sin  (tot—kx). 
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The  dependence  of  s  on  x  at  some  fixed  instant  of  time  is  shown 
in  Fig.  30.3.  This  wave  differs  from  a  sine  wave  in  that  its  am- 
p  litude 

,  „  .  I  It  d  co  —  x  dk  \  I 

A  =  2A0  cos  ^  g — — ~J  | 

is  a  slowly  varying  function  of  the  coordinate  x  and  time  t. 
Taken  as  the  velocity  of  propagation  of  this  nonsine  wave  is  a 


velocity  u  of  point^Af  in  which  the  amplitude  A  has  some  fixed 
value  (for  example*  A  =  0  or  A  —  2A 0) .  Consequently,  the  law 
of  motion  of  point  M  is:  t  da>  —  x  dk  =  const.  From  this 

_ dx  dco 

dt  dk  ■' 

Velocity  u  is  called  the  group  velocity.  It  is  equal  to  the  velocity 
of  energy  transfer  by  a  quasi-sine  wave.  The  group  velocity 
u  —  d(o/dk  is  suitable  for  describing  energy  transfer  (signal 
transmission)  by  nonsine  waves  having  a  different  frequency 
spectrum  provided  that  the  spectrum  is  not  very  wide  and  that 
the  dispersion  of  waves  in  the  medium  at  these  frequencies  is 
not  too  great. 

The  relation  between  the  group  velocity  ( u  =  d®ldk)  and  the 
phase  velocity  ( v  =  co/fc)  of  waves  is  of  the  form: 


where  %  is  the  wavelength  (30.2.6).  In  a  nondispersive  medium 
doldk  =  0  and  the  group  and  phase  velocities  coincide. 


414 


elastic  waves  [30 


30.5  Interference  of  Waves.  Standing  Waves 

30.5.1  Two  waves  are  said  to  be  coherent  if  their  phase  difference 
is  independent  of  time.  Coherent  waves  correspond  to  coherent 


b 


Fig.  30.4 

A  of  the  resultant  nonharmonic 
M  varies  periodically  with  tin 
(see  Sects.  28.4.2  and  30.2.8): 


oscillations  (or  vibrations) 
(28.4.3).  The  sources  of  co¬ 
herent  waves  are  called  coher¬ 
ent  sources.  Sine  waves  of 
equal  frequency  are  always 
coherent.  Waves  of  different 
frequencies  are  coherent  only 
for  the  length  of  time  that 
the  oscillations  (or  vibra¬ 
tions)  they  excite  are  co¬ 
herent  (28.4.4).  In  super¬ 
posing  incoherent  sine  wa¬ 
ves,  generated  by  point 
sources  Si  and  S2  (Fig.  30.4), 
the  square  of  the  amplitude 
vibration  at  arbitrary  point 
s  t  according  to  the  relation 


A*  =  A*-\-  Al  +  241^1,cos[((02  —  (Dj)  t 
(V2  —  Vi)  -r  («2  -  «x)l- 

Here  Ax  and  A2,  Wj  and  co2,  k1  and  k2,  and  at  and  a2  are  the 
amplitudes  at  point  M,  angular  velocities,  wavenumbers  and 
the  initial  phases  of  the  two  spherical  waves  being  superposed. 
The  period  of  variation  of  A 2  is  equal  to  T  —  2n/\  co2  —  oq  |. 
The  average  value  of  the  square  of  the  amplitude  during  the 
period  is 


(A*)  =  A\  +  A  l 


In  the  superposition  of  incoherent  waves,  the  squares  of  their 
amplitudes  are  added  together. 

30.5.2  The  interference  of  waves  is  the  phenomenon,  in  super¬ 
posing  waves,  of  their  mutual  strengthening,  stable  with  time, 
at  certain  points  in  space  and  their  weakening  at  others,  depend¬ 
ing  upon  the  relations  between  the  phases  of  these  waves. 
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Only  coherent  waves  display  interference,  provided  that  they 
correspond  to  vibrations  in  the  same  or  almost  the  same  direc¬ 
tion. 

In  superposing  coherent  spherical  waves,  generated  by  the  point 
sources  Sx  and  S2  (Fig.  30.4), 

*i  =  sin  (col  —  -j-  ax)  =  ,<4X  sin  <J>X 

and 

s2  =  A2  sin  (cot  —  kr2  +  <x2 )  =  sin  <P2; 

amplitude  A  and  phase  <P  of  the  resultant  harmonic  vibrations 
at  point  M  (where  s  =  sx  -j-  s2  =  A  sin  O)  are  determined 
from  the  relations  (28.4.3): 

Ai  =  A2i+Al-\-2A1A2cos  [k  (r2— rx)  — (a2— ax)], 

tan  CD  =  Ai  sin  ^1  +  ^2  sin  cj>2 

Ax  COS  CDj+^2  COS  <1>2  ’ 

Since  the  difference  of  the  initial  phases  a2  —  ocx  =  const  for 
the  vibration  of  the  coherent  sources  S2  and  Su  the  resulting 
interference  of  the  two  waves  at  various  points  M  depends  on 
the  quantity  A  =  r2- —  rx,  called  the  path  difference  of  the  waves. 
At  interference  maxima,  the  amplitude  of  the  resultant  vibration 
is  A  =  Ax  +  A2,  whereas  at  interference  minima  A  = 
=  \  Ai  —  A2  \.  Maxima  are  observed  at  the  noints  M  that 
comply  with  the  condition 

k  A  —  (a2  —  ax)  =  ±2mn, 

where  m  =  0,  1,  2,  ...  is  the  order  of  the  interference  maximum. 
The  condition  for  interference  minima  is  of  the  form 

k  A  —  (a2  —  ax)  —  ±  (2m  —  1)  it, 

where  m  —  1,  2,  3,  ...  is  the  order  of  the  interference  minimum. 
Since  the  wavenumber  k  —  2 n/k,  where  \  is  the  wavelength  in 
the  given  medium,  the  conditions  for  interference  maxima  and 
minima  can  be  presented  in  the  form:  ■; 

A  =  ±  m’k+  Ct'2-  Kl  X  (maxima), 

Z3X 

4-±<2»-l)A+ 


X  (minima). 
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Finally,  when  a2  =  oc^the  conditions  take  the  form:  JA  = 
=  ±mX  (maxima)  and  A  =  +  (2 m  —  1)  A,/2  (minima). 

On  the  straight  line  ab ,  parallel  to  and  at  the  distance  L  from 
the  line  passing  through  the  sources  Sx  and  S2,  (Fig.  30.4),  the 
central  maximum  of  zero  order  is  at  point  0,  equidistant  from 
sources  Sx  and  S2.  If  the  distance  between  the  sources  l  <  L, 
then  for  point  M  on  line  ab,  at  the  distance  z  <c  L  from  point  O, 
the  path  difference  of  the  waves  is 


A 


Iz 

T’ 


Maxima  of  the  mth  and  (m  +  l)th  orders  correspond  to  the 
values 


zm  — 


mXL 


and 


l 


so  that  the  difference  between  adjacent  maxima  equals  XL/l. 

30.5.3  When  waves  interfere,  their  energies  cannot  be  simply 
added  together.  At  interference  maxima,  the  intensity  of  the 
resultant  wave  is  greater  than  the  sum  of  the  intensities  of  the 
superposed  waves,  and  at  minima,  it  is  less  than  their  sum. 
Wave  interference  leads  to  a  redistribution  of  vibrational  energy 
between  neighbouring  regions  of  the  medium.  But  on  an  average, 
for  a  large  region  of  space,  the  energy  of  the  resultant  wave  is 
equal  to  the  sum  of  the  energies  of  the  interfering  waves.  This 
follows  from  the  laws  of  conservation  and  conversion  of  energy. 

30.5.4  A  special  case  of  wave  interference  is  standing  waves. 
A  standing  wave  is  one  that  is  formed  by  the  superposition  of 
two  travelling  sine  waves  that  are  propagating  toward  each 
other  and  have  the  same  frequency  and  amplitude.  In  the  case 
of  transverse  waves,  they  also  have  the  same  polarization 
(30.2.2).  A  standing  wave  is  formed,  for  instance,  in  a  stretched 
flexible  string  when  one  end  is  fixed  and  vibratory  motion  is 
imparted  to  the  other  end. 

In  the  superposition  of  two  coherent  travelling  plane  waves  of 
the  form 

si  =  A  sin  (CM  —  kx )  and  s2  =  A  sin  (at  +  kx  +  a), 
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where  a  is  the  phase  difference  at  points  x  =  0,  a  plane  standing 
wave  is  formed.  It  is  described  by  the  equation 


s=  Sj  -j-s2  =  2A  cos 


(*z+.f-)  sin  («<+-!■). 


In  contrast  to  the  amplitude  A  of  travelling  waves,  the  ampli¬ 
tude  ,4st  of  a  standing  wave  is  a  periodic  function  of  the  coor¬ 
dinate  x: 


^lst  —  2.A 


cos 


(*x+t) 


30.5.5  Points  of  a  standing  wave  at  which  .4st  =  0  are  called 
its  nodes;  points  at  which  the  amplitude  /lst  reaches  its  maxi¬ 
mum  value  (.4st  =  2 A)  are  called  the  antinodes.  The  positions 
of  the  nodes  and  anti  nodes  are  determined  from  the  conditions: 


kx  -}-  -?-  =  (2m  T-  1)-?-  (nodes), 
kx  -|-  —a=mn  (antinodes), 
where  m  —  0,  1,  2,  .... 

The  distances  between  two  adjacent  nodes  and  between  two 
adjacent  antinodes  are  the  same  and  equal  to  half  the  wave¬ 
length  X  of  the  travelling  waves.  This  distance  is  called  the 
length  of  the  standing  wave:  7,st  =  X/2.  The  distance  between 
adjacent  nodes  and  antinodes  of  a  standing  wave  is  equal  to 

Xst/2. 

30.5.6  In  a  travelling  wave  the  phase  of  the  vibration  depends 
upon  the  coordinate  x  of  the  point  being  considered.  In  a  stand¬ 
ing  wave  all  the  points  between  two  nodes  vibrate  with  differ¬ 
ent  amplitudes,  but  in  the  same  phase  (cophased)  because  the 
argument  of  the  sine  in  the  equation  of  a  standing  wave  (30.5.4) 
is  independent  of  the  coordinate  x.  In  passing  through  a  node, 
the  phase  of  the  vibration  changes  abruptly  by  n,  so  that  the 
sign  of  cos  (kx  +  a/2)  is  reversed. 

Illustrated  in  Fig.  30.5  is  the  nature  of  the  motion  of  various 
points  on  a  stretched  flexible  string  of  length  l  when  a  trans¬ 
verse  standing  wave  is  established  in  it.  The  left  end  O  of  the 
string  is  set  into  harmonic  motion  and  the  right  end  N  is  fixed. 


27-0186 
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In  this  case,  as  the  wave  is  reflected  from  the  place  where  the 
string  is  fixed,  its  phase  is  changed  by  n,  so  that  a  node  of  the 
standing  wave  is  formed  at  this  point.  The  phase  difference 
of  the  reflected  and  incident  waves  at  point  0  (where  x  =  0)  is 
a  =  —  (2 kl  +  ax).  The  nodes  of  the  standing  wave  are  designated 


Fig.  30.5 


by  small  circles  in  Fig.  30.5,  and  the  instant  of  time  t0  is  selected 
so  that  sin  (toi0  -|-  a/2)  =  0. 

30.5.7  In  a  standing  wave  (30.5.4),  the  velocity  of  vibratory 
motion  of  the  particles  of  the  medium  is 

=  -^j-=2i4co cos  cos  (“f  +  ir)  » 

and  the  relative  deformation  of  the  medium  is 

e  =  2Ak  sin  ^7ca:+-^- j  sin^coX-f  -y) 

=  2Ak  sin  (k*+vr)  cos  (cof  -f  . 
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Thus,  in  contrast  to  a  travelling  wave  (30.3.4),  in  a  standing 
wave  e  has  a  phase  lead  of  ji/2  over  vv  Consequently,  at  the 
instants  of  time  when  i>x  reaches  its  amplitude  value,  e  becomes 
/.cro  and  vice  versa.  Moreover,  the  amplitudes  of  vx  and  e  depend 
upon  the  coordinate  x,  but  in  different  ways.  Thus,  the  anti¬ 
nodes  of  the  particle  velocity  and  nodes  of  deformation  of  the 
medium  are  at  the  antinodes  of  the  standing  wave  (30.5.5), 
whereas  the  antinodes  of  the  deformation  and  the  nodes  of  the 
velocity  are  at  the  nodes  of  the  standing  wave. 

In  an  elastic  standing  wave,  the  energy  is  periodically  converted 
from  potential  energy,  localized  mainly  near  the  deformation 
anlinodes,  into  kinetic  energy,  localized  mainly  near  the  velocity 
untinodes,  and  back  again.  Therefore,  energy  periodically 
migrates  from  the  nodes  of  a  standing  wave  to  its  antinodes  and 
hack  again.  But  in  the  nodes  and  antinodes  themselves  the 
energy  flux  density  is  identically  equal  to  zero.  The  average 
value  of  the  energy  flux  density  over  one  period  is  equal  to  zero 
at  any  point  on  a  standing  wave,  because  the  two  travelling 
waves  that  form  the  standing  wave  transmit  equal  amounts  of 
energy  over  a  period  in  opposite  directions.  This  is  why  they 
have  been  called  standing  waves. 

30.5.8  In  the  free  vibrations  of  strings,  rods  and  columns  of  gas, 
standing  waves  are  set  up  in  them.  The  frequencies  of  these 
waves  comply  with  definite  conditions,  i.e.  they  can  have  only 
certain  discrete  values,  which  are  called  the  natural  frequencies 
of  vibration  of  the  corresponding  vibratory  system.  Displace¬ 
ment  nodes  (deformation  antinodes)  are  located  at  the  rigidly 
fixed  ends  of  strings  or  rods,  and  displacement  antinodes  (defor¬ 
mation  nodes)  are  located  at  the  free  ends  of  the  rods.  In  the 
vibrations  of  a  cylindrical  column  of  gas  in  a  pipe,  a  pressure 
antinode  is  located  at  the  closed  end  of  the  pipe  and  a  pressure 
node  at  the  open  end. 

If  I  is  the  length  of  the  string,  rod  or  gas  column,  v  is  the  phase 
velocity  of  the  wave  and  X  is  the  wavelength,  then  for  strings 
and  rods  fixed  at  both  ends  and  for  gas  columns  in  pipes  that 
are  open  or  closed  at  both  ends,  a  whole  number  of  lengths 
A,st  =  XI 2  of  the  standing  wave  fits  into  the  length  l,  i.e. 

I  =  mXst  =  mkl 2,  where  m  —  1,  2,  3,  .  .  . 

The  natural  frequency  of  vibrations  of  such  systems  is 


mv 


27* 
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For  rods  fixed  at  one  end  and  free  at  the  other,  or  for  pipes 
closed  at  one  end  and  open  at  the  other, 

l  =  (2m  —  l)-^p-=(2m—  1)  -|- 

and  the  natural  frequency  of  vibration  is 
_  (2m  — 1)  v 
4 1 

30.6  Doppler  Effect  in  Acoustics 

30.6.1  The  Doppler  effect  refers  to  the  change  in  the  frequency 
of  waves,  observed  or  recorded  by  a  receiver,  or  detector,  which 

is  due  to  motion  of  the 
source  of  the  waves  and  the 
detector  (or  observer).  For 
example,  as  a  rapid  train 
approaches  a  stationary  ob¬ 
server,  the  pitch  of  the  loco¬ 
motive  whistle  is  higher, 
and  as  the  train  passes  the 
observer  and  travels  away 
it  is  lower,  than  the  pitch 
of  the  same  whistle  when 
the  train  is  standing  still. 
Assume  that  the  detector 
D  of  acoustic  waves  in  a  gas¬ 
eous  (or  liquid)  medium 
is  fixed  with  respect  to  the 
medium,  and  that  the  source 
S  is  moving  away  at  the 
velocity  Vj  along  a  straight 
line  joining  them  (Fig.  30.6a). 
During  the  period  T0  of 
its  vibration,  the  source 
travels  in  the  medium  a 
distance  of  vtT0  =  yj/vj, 
where  v0  is  the  source’s  fre¬ 
quency  of  vibration.  There¬ 
fore,' upon  motion  of  the  source  the  wavelength  X  in  the  medium 
differs  from  its  value  Xa  when  the  source  is  stationary: 

A,  =  A,0  +  =  (v  +  m)  T0  =  (v  +  i>j)/v0, 


Fig.  30.6 
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where  v  is  the  phase  velocity  of  the  wave  in  the  medium.  The 
frequency  of  the  wave  recorded  by  the  detector  is 

_  v  Vp 
V  X  1  -\-v1lv 

if  the  velocity  vector  vj  of  the  source  makes  the  arbitrary  angle 
(>!  with  the  radius  vector  R,  connecting  the  stationary  detector 
to  the  source  (Fig.  30.66),  then 


1  -j-  i>i  cos  QJv  ' 

30.6.2  When  the  source  is  stationary  and  the  detector  approaches 
it  at  the  velocity  v2  along  a  straight  line  joining  them  (Fig.  30.6c), 
the  wavelength  in  the  medium  is  X  =  X0  =  u/v0.  But  the  veloc¬ 
ity  of  propagation  of  the  wave  with  respect  to  the  detector  is 
v  +  v2.  Hence,  the  wave  frequency  recorded  by  the  detector  is 


In  the  case  when  the  velocity  vector  v2  makes  the  arbitrary 
angle  02  with  radius  vector  R,  connecting  the  travelling  detector 
to  the  stationary  source  (Fig.  30. 6d), 

v  =  v0  +  cos02j  . 

30.6.3  In  the  most  general  case,  when  both  the  detector  and 
source  of  acoustic  waves  are  travelling  with  respect  to  the 
medium  at  arbitrary  velocities  (Fig.  30. 6e), 

1+—  cos  02 

V 

V  -  v0 - . 

1+—  cos  0, 

V  1 

This  formula  can  also  be  presented  in  the  form 
v  =  v„{l— £3-008  0,+  (i-cos^)2—  ...]}  , 
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where  V  =  vt  —  v2  is  the  velocity  of  the  wave  source  with 
respect  to  the  detector,  and  0  is  the  angle  between  vectors  V 
and  R.  The  quantity  V  cos  0,  equal  to  the  projection  of  V 
on  the  direction  of  R,  is  called  the  ray  velocity  of  the  source. 
When  t>j  «  v, 


v  v„ 


(‘ 


V  cos  0 

V 


)■ 


CHAPTER  31  ELECTROMAGNETIC  WAVES 


31.1  Properties  of  Electromagnetic  Waves 

31.1.1  Electromagnetic  waves  are  disturbances  of  an  electromag¬ 
netic  field  (i.e.  a  variable  electromagnetic  field)  that  prop¬ 
agate  through  space.  The  statement  of  the  existence  of  electro¬ 
magnetic  waves  follows  directly  from  Maxwell’s  equations 
(27.4.3).  For  an  electromagnetic  field,  distant  from  the  free 
electric  charges  (18.3.1)  and  macroscopic  currents  (26.4.1)  that 
it  was  set  up  by,  these  equations  are  of  the  form* 

curlE  =  curlH  =  — , 

div  D  =  0,  div  B=0. 

31.1.2  If  the  medium  is  a  uniform  and  isotropic  dielectric 
(30.1.6),  having  no  ferroelectric  (18.4.1)  or  ferromagnetic 
(26.5.1)  properties,  then  D  =  8re0E  and  B  =  |irp0H>  where 
er  and  p,r  are  constant  scalar  quantities  independent  both  of 
the  coordinates  and  of  time.  Here  Maxwell’s  equations  (31.1.1) 
can  be  rewritten  in  the  form 

curl  E  =  —  p,rp0  ,  curl  H  =  ere0  ,  div  E  =  0 

ut  ot 

and  div  H  =  0, 


*  All  equations  in  Chap.  31  are  written  in  SI  units. 
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or  in  projections  on  the  axes  of  Cartesian  coordinates: 
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dEy 
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1 
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=  —  flrpo 
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dEy 

dEx  _ 

dHz 

dHy 

dHx 

—  p  „  . 

dEz 

dx 

dy 

=  — PrM'O 

dt  ’ 

dx 

dy 

dt  * 

dEx  , 

™y  , 

3EZ  __ 

o 

8HX 

,dHy 

.,  UL 

L  — 0. 

dx  ' 

dy  1 

dz 

dx 

f  dy 

‘  dz 

31.1.3  It  follows  from  Maxwell’s  equations  (31.1.2)  that  the 
strength  vectors  E  and  H  of  a  variable  electromagnetic  field 
and  their  projections  onto  the  axes  of  Cartesian  coordinates 
comply  in  a  uniform,  isotropic,  nonconducting  medium,  with 
the  wave  equation  (30.2.9): 


AE— ere0prp0  -^-=0, 

d 2 

AEX—  er80priio  ,dtf  =0. 
EEZ  —  Ere0(xrp0  Qp  =0, 
EHX — 8re0prp.0  —  0, 

A  IT  Z<EHz  n 

EHZ  —  ErE0|irflo  Qp - 0. 


AH— ere0prPo  -Qp-=°> 
d*Ey 

\EV— ere0prP»  =  0, 


[d*Hy 

EHy  ere0prPo  — ^p — — 0, 


Thus  a  variable  electromagnetic  field  really  propagates  in  space 
in  the  form  of  waves  with  the  phase  velocity 

c  1 

v  =  - — - . ■■■  ,  where  c—  ,_^z-=3xl08  m/s. 

V  8r^r  V  eof-lO 

In  vacuum  er  =  pr  =  1.  Hence,  c  is  the  velocity  of  electro¬ 
magnetic  waves  in  vacuum  (in  free  space). 
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31.1.4  Electromagnetic  waves  are  transverse  waves:  vectors  E 
and  H  of  the  field  lie  in  a  plane  perpendicular  to  the  direction 
of  wave  propagation,  i.e.  to  the  wave  velocity  vector  v  at  the 
point  being  considered  in  the  field.  This  can  be  most  readily 
demonstrated  by  the  example  of  a  plane  wave,  propagating 
along  the  positive  direction  of  axis  OX  (30.2.4): 


E  =  )  and  H  =  q>(*--f). 

Vector  E  and  H  and  their  projections  on  the  coordinate  axes  are 
independent  of  y  and  z: 


dEx 

_pEx  _ 

dEy 

dEy 

dEz 

dEz 

dy 

and 

dz 

dy 

dz 

dy 

dz 

dll  x 
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dHy 

dHy 

_  dllz 

dHz 

dy 

dz 

dy 

dz 

dy 

dz 

It  follows  from  Maxwell’s  equations  (31.1.2)  that  for  the  field 
of  a  plane  wave, 


dEx 

~d" 


dEx  .  , 

=— — —  =  0  and 
dt 


dll  x 
dx 


dffx 

dt 


=  0, 


i.e.  Ex  and  Hx  are  independent  of  both  coordinates  and  time. 

Hence,  for  a  variable  field  of  a  plane 
wave  Ex  =  Hx  =  0,  and  the  vectors 
E  and  II  are  perpendicular  to  the 
direction  of  wave  propagation: 

E=  Ey]  +  Ezk  and  H  =  HV)  +  Hzk, 


where  j  and  k  are  unit  vectors  of  the 
coordinate  axes. 

31.1.5  Vectors  E  and  H  of  the  field 
of  an  electromagnetic  wave  are  mu¬ 
tually  perpendicular,  so  that  v,  E 
and  II  form  a  right-hand  set  of  three  vectors  (Fig.  31.1).  As  a 
matter  of  fact,  for  the  field  of  a  plane  wave  (31.1.4), 

Ey  =  fi[t  [7)1  Ez  =  f2  - —  )  , 
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//„  = 


and 


It  follows  from  Maxwell’s  equations  (31.1.2)  that 


Y  ere0 

1  8r80 


dEy 

~dT 

dEz 

dl 


=  V hrHo  and 

-  dUy 

,  =  -^T  ’ 


where  £  =  t  —  xlu.  Therefore,  for  a  variable  field  of  a  plane 
wave 


'h 


-Y 


reo 


llrllo 


Ez, 


II  z  =  l/ -7^-  Ey  and  (EH)  =  0. 

r  PrR) 


The  mutually  perpendicular  vectors  E  and  H  oscillate  in  phase: 
they  become  zero  simultaneously  and  reach  their  maximum 
values  simultaneously.  The  relation  between  their  magnitudes  is 

/ers0  E  =  Y PrPo  II, 


which  is  valid  for  any  travelling  electromagnetic  wave  (30.2.1) 
regardless  of  the  shape  of  its  wavefronts  (30.2.3). 

31.1.6  A  monochromatic  wave  is  an  electromagnetic  wave  of  a 
single  definite  frequence  v,  i.e.  an  electromagnetic  sine  wave. 
At  each  point  of  the  electromagnetic  field  of  a  monochromatic 
wave,  the  projections  of  vectors  E  and  H  on  the  coordinate  axes 
of  an  inertial  frame  of  reference  have  harmonic  oscillations 
(28.1.3)  of  the  same  frequency,  equal  to  the  wave  frequency  v. 
For  example,  in  the  case  of  a  plane  monochromatic  wave,  prop¬ 
agating  along  the  positive  direction  of  axis  OX, 

Ey  =  Al  sin  (c ot-kx),  Hy  =  -  j/" Ez, 

r  MtM'O 

Ez  =  Azsin{(ot—  kx  +  tp),  Hz=  1^ Ey, 

'  Mrllo 

where  co  =  2jtv  is  the  angular  frequency  of  the  wave,  k  is  the 
wavenumber  (30.2.6),  Ax  and  A%  are  the  amplitudes  of  Ey  and 
Ez,  and  q>  is  the  phase  difference  between  the  oscillations  of 
Ez  and  Ef/. 
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31.1.7  For  an  arbitrary  value  of  q>  (31.1.6),  a  plane  monochromat¬ 
ic  wave  is  elliptically  polarized,  i.e.  at  each  point  of  the  wave’s 
field,  vectors  E  and  H,  while  remaining  mutually  perpendicular, 
vary  with  time  so  that  their  heads  describe  ellipses  lying  in  a 
plane  perpendicular  to  the  direction  of  wave  propagation: 


El 


El 


A! 

and 

H 2 

v 

Ai 


A! 


HI 


Al 


2  EyEz 


AiAz 


■  cos  q>  =  sin2  (p, 


2  HyHx 


AiAz 


-  cos  q>  = 


CrCp 

PrPo 


sin2  q). 


In  particular,  when  Ax  =  A2  and  q>  =  ±  (2m  -f  1)  J*/2,  where 
m  =  0,  1,  2,  .  .  .,  the  ellipses  become  circles: 


E\  +  El  =  A\ 


and  Hl+Hl  =  -^-A\. 
v  2  PrPo 


Such  a  wave  is  said  to  be  circularly  polarized. 

When  q>  =  ±mn,  where  m  =  0,  1,  2,  .  .  .,  the  ellipses  degener¬ 
ate  into  straight  lines: 


and 


Hv  Hz 
A2  [A 


Such  a  wave  is  said  to  be  linearly,  or  plane,  polarized.  Shown 
in  Fig.  31.2  are  the  values  of  E  and  H  of  the  field  of  a  plane, 
linearly  polarized,  monochromatic  wave  at  various  points 
along  the  ray  (OX  axis)  at  the  same  instant  of  time.  Axes  OY 
and  OZ  are  in  the  directions  of  oscillation  of  vectors  E  and  H, 
respectively,  so  that  Ez  =  Hy  =  0.  According  to  accepted 
terminology,  the  plane  passing  through  the  electric  vector  E 
and  the  ray  is  called  the  plane  of  polarization  of  the  linearly 
polarized  wave.  Previously,  this  plane  was  called  the  plane 
of  wave  oscillation,  and  the  plane  of  polarization  was  called  the 
one  passing  through  the  magnetic  vector  H  and  the  ray  (this 
obsolete  terminology  may  still  be  found  in  the  literature). 
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31.1.8  An  arbitrary  plane  monochromatic  wave  can  be  repre¬ 
sented  as  a  combination  of  two  plane  monochromatic  waves  of 
the  same  frequency  as  the  first  wave  and  simultaneously  prop¬ 
agating  in  the  same  direction  as  the  first  wave,  but  linearly 


Plane  of  polarization 

'  A  \ 


Fig.  31.2 

polarized  in  mutually  perpendicular  planes.  For  example,  a 
plane  monochroma'tic  wave,  propagating  along  the  OX  axis 
(31.1.6)  can  be  dealt  with  as  the  result  of  superposition  of  a 
y-wave  (E,  =  Ey)  and  a  z- wave  (E2  =  Ez). 


31.2  Energy  of  Electromagnetic  Waves 

31.2.1  The  volume  energy  density  of  an  electromagnetic  field, 
in  an  isotropic  medium  is 

E-(j _  ere0E2  ]  PrM-of?2 

2^2  ’ 

where  er  and  pr  are  the  relative  permittivity  (dielectric  con¬ 
stant)  and  relative  permeability  of  the  medium.  From  the  rela¬ 
tions  between  the  magnitudes  of  vectors  E  and  H  of  the  field 
set  up  by  an  electromagnetic  wave  (31.1.5)  it  follows  that  the 
volume  energy  density  of  electromagnetic  waves  is  equal  to 

£d  =  erE0E2  =  pr|x0jy?==  y  ere0prp0  EH=- EH, 

c 
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where  c  is  the  velocity  of  electromagnetic  waves  in  free  spaee 
(31.1.3). 

31.2.2  In  the  case  of  a  plane,  linearly  polarized,  monochromatic 
wave,  propagating  along  the  positive  direction  of  axis  OX,  the 
field  strength  E  =  A  sin  (on  —  kx).  Consequently,  the  volume 
energy  density  of  this  wave  is 

Ed  =  ere0.42  sin2  (cof  —  kx). 

The  value  of  E&  at  each  point  of  this  field  varies  periodically 
with  the  frequency  (o/n  in  a  range  from  0  to  =  sre0/i2. 

The  average  value  of  El1  over  one  period  is  proportional  to  the 
square  of  the  amplitude  of  the  field  strength: 

Jt/co 

(£d)  =  -SL  j  £ddf  =  i-8re0i4*. 

o 

If  the  plane  monochromatic  wave  has  an  arbitrary  (elliptical) 
polarization  (31.1.7),  then,  according  to  Sect.  31.1.8, 

Ed  =  ere0  [Aj  sin2  (tot  —  kx)  -j-  A |  sin2  (0)1  —  kx  +  q>).] 
and 


(£d)  =  i-ere0(^+^2). 


31.2.3  The  energy  flux  density  vector  (30.3.5)  of  an  electro¬ 
magnetic  wave  is  called  the  Umov-Poynting  vector  P  (more 
frequently,  it  is  called  the  Poynting  vector).  The  velocity  of 
energy  transfer  by  a  monochromatic  wave  is  equal  to  the  phase 
velocity  v  —  c/Y crpr  of  this  wave  (30.3.5).  Therefore,  the 
Umov-Poynting  vector  for  such  a  wave  is 

P  =  £dv  =  [EH]. 


In  the  case  of  a  plane,  linttrly  polarized,  monochromatic  wave 
(31.2.2)  the  Umov-Poynting  vector  is  in  the  direction  of  wave 
propagation  and  is  numerically  equal  to 


P  = 


C  r  C  o 

PrPo 


^42  sin2  (tot — kx). 
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When  a  plane  monochromatic  wave  is  elliptically  polarized 
(31.1.7), 

P=  1/  8re°  [/If  sin2  (cot —  kx)JrA\  sin2  (a )f —  kx-{-  q>)]. 
v  Prl’o 

21.2.4  The  intensity  I  of  an  electromagnetic  wave  is  a  physical 
quantity  numerically  equal  to  the  energy  transferred  by  the 
wave  in  unit  time  through  unit  area  of  a  surface  located  per¬ 
pendicular  to  the  direction  of  wave  propagation.  The  intensity 
of  an  electromagnetic  wave  is  equal  to  the  absolute  value  of  the 
average  Umov-Poynting  vector  during  a  period  of  its  complete 
oscillation: 

/  =  |(P>|. 

The  intensity  of  a  travelling  monochromatic  wave  is 
I  =  (£d>  v, 

where  v  is  the  phase  velocity  of  the  wave,  and  (Ea)  is  the  average 
value  of  the  volume  energy  density  of  the  field  of  the  wave. 
The  intensity  of  a  plane,  linearly  polarized,  travelling  mono¬ 
chromatic  wave  (31-2.2)  is  proportional  to  the  square  of  the 
amplitude  A  of  oscillation  of  the  wave’s  field  strength  vector  E. 
Thus 


j  =  J_l/  JeSl  a* 

2  V  prp0  A  ’ 

When  a  plane  monochromatic  wave  is  elliptically  polarized 
(31.1.7),  its  intensity  is  equal  to  the  sum  of  the  intensities  of 
the  y-  and  z-waves  that  form  the  wave  being  considered  (31.1.8): 

/-/,+/,  -Tj/  -iS-m/l). 

Note.  The  intensity  of  light ,  i.e.  the  electromagnetic  waves  dealt 
with  in  optics,  is  commonly  understood  to  be  the  square  of  the 
amplitude  of  oscillation  of  the  light  wave’s  field  strength  vec¬ 
tor  E. 

31.2.5  The  intensity  /  of  a  spherical,  linearly  polarized,  mono¬ 
chromatic  wave  is  related  to  the  amplitude  A  of  oscillation 
of  vector  E  in  the  same  way  as  for  a  plane  wave  (31.2.4).  But 
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the  amplitude  and  intensity  of  a  spherical  wave  decrease  as  the 
distance  r  increases  from  the  wave  centre  (30.3.6): 

A  =  a0/r  and  I  —  i0/r2. 

31.2.6  Electromagnetic  waves  exert  pressure  on  the  obstacles 
in  their  path  that  absorb  and  reflect  the  waves.  The  pressure 
electromagnetic  waves  is  explained  by  the  fact  that  due  to  the 
effect  of  the  waves’  electric  field  charged  particles  of  matter  are 
put  into  ordered  motion  and  are  subject  to  the  action  of  Lorentz 
forces  exerted  by  the  magnetic  field  (24.1.1). 

According  to  Maxwell’s  theory  the  pressure  of  electromagnetic 
waves  is  equal  to 

p=^-(l-f-i?)  cos2i=(£d)(l  +  f?)  cos2  i, 

where  I  is  the  intensity  of  the  wave  incident  on  the  obstacle, 
v  is  its  velocity,  {£&)  is  the  average  value  of  the  volume  energy 
density  of  the  wave,  i  is  the  angle  of  incidence  (31.5.2),  and 
R  is  the  reflection  coefficient  (31.5.6). 


31.3  Electromagnetic  Radiation 

31.3.1  The  process  of  exciting  electromagnetic  waves  and  emit¬ 
ting  them  into  surrounding  space  by  some  system  is  called  the 
radiation  of  these  waves.  The  system  itself  is  called  the  radiating 
system.  The  field  set  up  by  these  electromagnetic  waves  is  called 
the  radiation  field.  According  to  the  concepts  of  classical  electro¬ 
dynamics,  electromagnetic  waves  are  excited  by  electric  charges 
travelling  with  acceleration  (in  particular,  an  electric  circuit 
in  which  the  current  varies).  Vavilov-Cherenkov  radiation  can 
also  occur  in  matter  (34.6.1). 

The  simplest  radiating  system  is  the  electric  dipole  (15.2.3)  whose 
dipole  moment  pe  varies  with  time.  Such  an  “oscillating”  dipole 
is  called  an  oscillator,  or  elementary  vibrator.  Oscillators  are 
extensively  used  in  physics  for  simulating  and  calculating  the 
radiation  fields  of  real  systems.  If  the  radiating  system  is  elec¬ 
trically  neutral  and  its  size  is  small  compared  to  the  length  X 
of  the  waves  it  radiates,  then  in  the  wave  zone  of  the  system, 
i.e.  at  points  whose  distance  from  the  system  equals  r  >  k,  the 
radiation  field  is  almost  the  same  as  that  of  an  oscillator  having 
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I  ho  same  electric  dipole  moment  as  the  whole  radiating  system. 
31.3.2  A  linear  harmonic  oscillator  is  an  electric  dipole  whose 
moment  pe  varies  according  to  the  harmonic  law: 

Pe  =  Po  sin  tot,  where  p0  =  const. 

At  point  M  of  the  wave  zone  of  a  linear  harmonic  oscillator, 
located  in  vacuum,  the  vectors  E  and  H  of  the  radiation  field 
at  the  instant  of  time  t  are  equal  to: 

l:  (0=  — [fPor]  *■]  sin  (fot—kr), 

(0^ 

II  (0  =  —  [For]  sin  (at  —  kr), 

where  r  is  the  radius  vector  from  point  O,  where  the  dipole  is 
located,  to  point  M  (Fig.  31.3).  Vectors  E  and  H  are  mutually 
perpendicular  and  lie  in  a  plane  perpendicular  to  the  radius 


Fig. 31.3 


Fig.  31.4 


vector  r.  Hence,  E,  H  and  r  form  a  right-hand  set  of  three  vectors. 
Vector  E  is  tangent  to  the  meridian,  and  vector  H  is  tangent  to 
the  latitudinal  circle  passing  through  point  M  of  a  sphere  of 
radius  r  having  its  centre  at  point  0. 

31.3.3  The  Umov-Poynting  vector  at  point  M  of  the  wave  zone 
of  a  linear  harmonic  oscillator  is 


P  = 


PotD^jj  sin2  6 

16n2cr3 


r  sin2  (cot — kr), 


where  0  is  the  angle  between  vectors  p0  and  r. 
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The  intensity  of  the  electromagnetic  wave  at  point  M  is 

H0o)4P(i  sin2  6 

32n2cr2 


The  dependence  of  /  on  0  for  a  fixed  value  of  r,  shown  in  polar 
coordinates  (Fig.  31.4),  is  called  the  polar  directional  diagram 
of  oscillator  ( dipole )  radiation.  The  dipole  radiates  most  inten¬ 
sively  in  directions  that  make  an  angle  0  =  ji/2  with  its  axis, 
i.e.  in  a  plane  passing  through  the  middle  of  the  dipole  and 
perpendicular  to  its  axis.  Along  its  axis  (0  =  0  and  0  =  it) 
the  dipole  does  not  radiate  at  all. 

The  average  power  of  radiation  of  a  linear  harmonic  oscillator  is 


(Pr)  =  2jtr* 


n 

f  I  sin  0  d0 


FqM4Pq 

12jtc 


o 

31.3.4  The  instantaneous  power  of  radiation  of  charge  q,  trav¬ 
elling  at  the  acceleration  a,  is 


Pi  = 

ms 


Ho 

03XC 


If  the  charge  executes  harmonic  oscillation  of  the  amplitude  l0 
and  angular  frequency  to,  the  average  power  of  its  radiation  is 


lrr)  Hog  Wll 
v  '  12jtc  • 


According  to  classical  theory  the  radiation  of  light  by  atoms  is 
due  to  the  vibration  of  electrons  in  them.  Owing  to  the  con¬ 
sumption  of  energy  to  produce  radiation  this  vibration  is  grad¬ 
ually  damped,  i.e.  its  amplitude  varies  according  to  the  law 
(29.1.3):  l0  =  Z00  exp  (— fit),  where  l00  is  the  initial  amplitude 
and  p  is  the  damping  factor.  The  energy  of  vibration  of  an 
electron  (28.2.2)  is  E  =  where  me  is  the  mass  of  the 

electron.  Since  — dE  =  (Pr)  dt  =  2Pi?  dt  and  q  =  — e,  the 
relaxation  time  (29.1.3)  of  this  vibration,  called  the  average 
de-excitation  time  of  the  atom ,  is 

1  12ncme 
Fo«2w2  ‘ 

For  visible  light  ®s4x  1018  s'1  and  t»2x  10"8  s. 
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31.4  Electromagnetic  Spectrum 

111 .4.1  Depending  upon  the  frequency  v  (or  wavelength  X  =  c/v 
in  vacuum,  where  c  is  the  velocity  of  electromagnetic  waves  in 
free  space),  as  well  as  on  the  way  in  which  they  are  radiated 
and  recorded  (or  detected),  distinction  is  made  between  several 
kinds  of  electromagnetic  waves:  radio  waves,  optical  radiation. 
X-rays  and  gamma  rays. 

Itadio  waves  are  electromagnetic  waves  of  a  length  X  in  free  space 
that  exceeds  5  X  10"6  m  (correspondingly,  v  <  6  X  1012  Hz). 
In  connection  with  special  features  of  their  propagation  and 
generation,  the  full  range  of  radio  waves  is  commonly  divided 
into  nine  bands  (Table  31.1). 


TABLE  31.1 


Band  name 

Wavelength,  m 

Frequency,  Hz 

Very-long  (myria- 

Over  10* 

Less  than  3  x  10* 

metric)  waves 

' 

Long  (kilometric) 
waves 

10*  to  103 

3  x  10*  to  3  x  105 

Medium  (hecto- 

metric)  waves 

10s  to  102 

3  x  105  to  3  x  106 

Short  (decametric) 
waves 

103  to  10 

3  X  106  to  3  X  10’ 

Metric  waves 

10  to  1 

3  X  107  to  3  X  109 

Decimetric  waves 

1  to  0.1 

3  x  10s  to  3  X  109 

Centimetric  waves 

0.1  to  0.01 

3  X  109  to  3  x  10*° 

Millimetric  waves 

10~2  to  10-3 

3  X  10*°  to  3  x  10** 

Submillimetric 

waves 

10-3  to  5  X  10~5 

3x  1011  to  6  X  1012 

31.4.2  Radio  frequencies  are  divided  into  twelve  bands  accord¬ 
ing  to  the  international  regulations  for  radio  communications.; 
Thesejbands  and  their  corresponding  wavelength  in  free  space 
are  listed  in  Table  31.2. 
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31.4.3  Optical  radiation,  or  light,  refers  to  electromagnetic  waves 
(electromagnetic  radiation)  of  a  wavelength  in  vacuum  ranging 
irom  10  nm  to  1  mm  (by  convention).  Optical  radiation  includes 
infrared,  visible  and  ultraviolet  radiation.* 

Infrared  (IR)  radiation  is  the  electromagnetic  radiation  emitted 
by  heated  bodies.  Its  wavelength  in  vacuum  is  within  the  range 
from  1  mm  to  770  nm  (1  nm  =  10~9  m). 

Visible  radiation,  or  visible  light,  is  electromagnetic  radiation 
of  a  wavelength  in  vacuum  from  770  to  380  nm.  It  is  capable  of 
directly  producing  a  visible  sensation  in  the  human  eye. 
Ultraviolet  (UV)  radiation  is  electromagnetic  radiation  of  a 
wavelength  in  vacuum  from  380  to  10  nm. 

31.4.4  X-radiation,  or  X-rays  (also  called  Roentgen  rays),  is  the 
electromagnetic  radiation  produced  in  the  interaction  of  charged 
particles  and  photons  (37.1.4)  with  atoms  of  matter.  Its  wave¬ 
length  in  vacuum  ranges  within  the  wide  conditional  limits 
from  10  or  100  nm  to  0.01  or  1  pm  (1  pm  =  10~12  m). 

Gamma  radiation,  or  gamma  rays,  is  electromagnetic  radiation 
of  a  wavelength  in  vacuum  less  than  0.1  nm.  It  is  emitted  by 
excited  atomic  nuclei  upon  radioactive  transformations  and 
nuclear  reactions.  It  is  also  formed  in  particle  decay,  in 
particle-antiparticle,  annihilation  (43.5.5)  and  other  processes. 


31.5  Reflection  and  Refraction  of  Electromagnetic 
Waves  at  the  Interface  Between  Two  Dielectric  Media 

31.5.1  The  refractive  index  ( absolute  refractive  index)  of  a  medium 
is  the  quantity  n,  which  is  equal  to  the  ratio  of  the  velocity  c 
of  the  electromagnetic  waves  in  vacuum  to  their  phase  velocity 
v  in  the  medium:  n  =  civ.  From  Sect.  31.1.3  it  follows  that 
n  =  y  er(ir,  where  er  and  p,r  are  the  relative  dielectric  per¬ 
mittivity  and  relative  magnetic  permeability  of  the  medium. 


*  According  to  the  recommendation  of  the  Committee  for 
Scientific  and  Engineering  Terminology  of  the  USSR  Academy  of 
Sciences  [Physical  Optics  (Terminology),  Collection  of  Recom¬ 
mended  Terms,  Issue  74,  Nauka  Publishers,  1968,  Moscow] 
optical  radiation  also  includes  X-rays,  the  range  of  wavelengths 
of  optical  radiation  being  established  from  0.1  A  (10-11  m) 
to  1  cm. 
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For  media  having  no  ferromagnetic  properties,  pr  *  1  and 

n  ~  Y  er- 

The  relative  refractive  index  of  two  media  (of  the  second  medium 
with  respect  to  the  first)  is  the  quantity  n21,  equal  to  the  ratio 
of  the  refractive  indices  of  these  media:  n2l  =  n2ln^.  For  non¬ 


ferromagnetic  media  n2l  =  [/  er2/erl. 

31.5.2  When  an  electromagnetic  wave  falls  on  the  interface 
between  two  media,  it  is  partly  reflected  from  the  interface  and 

partly  refracted  as  it  enters 
the  second  medium.  In 
Fig.  31.5  the  line  AB  is 
the  plane  interface  between 
the  media.  Rays  1,  1'  and 
2  represent  the  directions 
of  propagation  of  the  inci¬ 
dent,  reflected  and  refracted 
plane  waves.  They  are  called 
the  incident,  reflected  and 
refracted  rays,  respectively, 
and  the  angles  between  them 
and  the  angles  they  make 
with  the  perpendicular  ab  to 
the  interface  at  the  point  of 
incidence  O  are:  i  is  the  angle  of  incidence,  i'  is  the  angle  of 
reflection  and  r  is  the  refractive  angle. 

The  plane  of  incidence  is  the  plane  passing  through  the  incident 
ray  and  perpendicular  to  the  interface  at  the  point  of  incidence. 

31.5.3  The  laws  of  reflection  and  refraction  of  electromagnetic 
waves  at  the  interface  between  two  dielectric  media  can  be 
obtained  on  the  basis  of  the  boundary  conditions  for  an  electro¬ 
magnetic  field  (27.4.4).  In  the  first  medium  the  field  of  the 
reflected  wave  (Er{1  and  Hrtl)  is  superposed  on  the  field  of  the 
incident  wave  (E°andH°).  In  the  second  medium  there  is  only 
the  field  (Erfr  and  Hrfr)  of  the  refracted  wave  (propagating  in 
this  medium).  Hence,  the  boundary  conditions  arc  of  the  form 
(assuming  that  pr2  =  prl  =  1): 


E\  +  ET*X  =  <r,  erl  (<+  <*)  =  Br2£f,fr 
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I  l(;re  Ex,  Hx,  7?n  and  Hn  are  the  projections  of  vectors  E  and  H 
on  the  tangential  plane  and  the  normal  to  the  interface  between 
Ihe  media.  It  follows  from  these  relations  that  in  the  incidence 
ol'  a  plane  monochromatic  wave  on  a  smooth  plane  interface 
between  media,  the  following  laws  are  complied  with  (regardless 
of  the  kind  of  polarization  of  this  wave): 

(a)  The  reflected  and  refracted  waves  are  also  monochromatic 
waves  of  the  same  frequency  as  the  incident  one. 

(h)  The  law  of  reflection  states  that  the  reflected  ray  lies  in  the 
plane  of  incidence,  and  the  angle  of  reflection  is  equal  to  the 
angle  of  incidence  (i'  =  i). 

(c)  The  law  of  refraction  states  that  the  refracted  ray  lies  in  the 
plane  of  incidence  and  that  the  refractive  angle  is  related  to  the 
angle  of  incidence  by  Snell's  law : 

sin  i  n2 

— : - = - -=«21- 

sin  r  nj 

31.5.4  Making  use  of  the  boundary  conditions  (31.5.3)  the  rela¬ 
tions  can  be  found  between  the  phases,  amplitudes  and  inten¬ 
sities  of  incident,  reflected  and  refracted  monochromatic  waves. 


Tt  is  sufficient  for  this  purpose,  according  to  Sect.  31.1.8,  to 
know  the  above-indicated  relations  for  linearly  polarized  waves 
of  two  types:  p-waves,  whose  vector  E  =  Ep  lies  in  the  plane  of 
incidence  and  whose  vector  H  =  Hp  is  perpendicular  to  this 
plane  (Fig.  31.6),  and  s-waves,  whoso  vector  E  =  Es  is  perpen¬ 
dicular  to  the  plane  of  incidence  and  whose  vector  H  Iis  lies 
in  this  plane  (Fig.  31.7). 


438 


ELECTROMAGNETIC  WAVES  [31 


The  relations  between  the  amplitudes  of  oscillation  of  vector  E 
in  the  incident  (^4°),  reflected  (•4r(I)  and  refracted  (^rtr)  waves 
in  the  cases  of  p-  and  s- waves  are  expressed  by  the  Fresnel  for¬ 
mulas: 

irfl_ .p  tan  (i  r)  .T[T .p  2  cost  sin  r 

P  P  tan(l  +  r)  ’  P  P  sin  (i  +  r)  cos  (j  —  r)  ’ 

jrfi _ .Q  sin  (i — r)  rfr  _  p  _2cos£sinr_ 

s  -  s  sin  (i  +  r)  ’  ~  s  sin  (i  +  r)  ' 


In  particular,  with  normal  incidence  of  waves  on  the  interface 
between  media  ft  =  r  =  0), 


jrfl_ 
Av  ~~ 

ATJ]  =  - 


-A 


0  nvi  ~ 
p 


I»21  +  l 

4O  n21  1 

5  P;i  +  1 


ATtT_  40  Z 

AP  AP  n21  +  l 

rfrtr. 


jrrr —  4U  " 

5  *  «21  +  1 


In  the  Fresnel  formulas  A1^  and  A°s  are  positive  quantities, 
whereas  A^r  and  A  ^fr  are  also  positive  for  any  possible  angles 
of  incidence  and  refractive  angles.  This  indicates  that  the  re¬ 
fracted  and  incident  waves  are  in  phase.  Quantities  A  ^rl  and 
can  be  either  negative  or  positive.  In  the  first  case,  illustrated 
in  Figs.  31.6  and  31.7,  the  phase  of  oscillation  of  vector  E  is 
changed  by  n  upon  reflection  (the  phase  of  oscillation  of  vector  H 
does  not  change).  In  the  second  case,  the  phase  of  oscillation  of 
vector  E  remains  unchanged  (and,  accordingly,  the  phase  of 
oscillation  of  vector  H  changes  by  rt). 

31.5.5  The  values  of  the  phase  difference  in  the  oscillation  of 
vector  E  upon  the  reflection  of  p-  and  s-type  electromagnetic 
waves  are  listed  in  Table  31.3,  in  accordance  with  the  condi¬ 
tions  (i.e.  depending  on  the  angle  of  incidence  i  and  the  relative 
refractive  index  of  the  media). 

The  angle  of  incidence  iBl.  at  which  the  reflected  and  refracted 
rays  are  perpendicular  to  each  other  is  called  Brewster's  angle. 
When  i  =  irjr,  then  i  +  r  =  it/2  and  it  follows  from  the  law 
of  wave  refraction  (31.5.3)  that^tan  iBr  =  re21.  It  is  evident 
from  the  Fresnel  formulas  (31.5.4)  that  at  i  —  tBr,  the  amplitude 
A  I/'  ==  0,  i.e.  the  p- wave  is  not  reflected  from  the  interface 

i 
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TABLE  31.3 


Conditions 

i  +  r  <  jr/2 
(i  <  tBr) 

i  +  r  >  Jt/2 

(i>iBr) 

Type  of  wave 

n2t  >  l 
(i  >  r ) 

n2 1  <  1 
(i<r) 

W21  ^  1 
(i>r) 

p-wave 

Jl 

0 

0  ' 

n 

*-wave 

n 

0 

n 

0 

between  the  media,  but  completely  passes  from  medium  1 
into  medium  2. 

31.5.6  The  coefficient  of  reflection  (or  reflection  coefficient)  R  of  an 
electromagnetic  wave  from  the  interface  of  two  media  is  the 
ratio  of  the  intensities  (31.2.4)  of  the  reflected  and  incident 
waves: 


R 


jrll 

~W 


t  \2 

V  i4#  7  * 


The  reflection  coefficients  of  the  p-  and  s-waves  are  equal  to 


_  tan*  (!—>•)  sin2  (i  — r) 

p  tan*  (i+r)  ’  s  sin2  (i  +  r)  * 


In  particular,  with  normal  incidence  of  waves  on  the  interface 
between  two  media  (t  —  r  =  0) 


p  —  Rs 


(Jhrzl-)2 

V  b21  +  1  / 


When  the  incident  wave  is  arbitrarily  polarized,  the  reflection 
coefficient  is 


/?  = 


rrfl  4_  rrfl 

1  p  '  1  5 


RPI0p  +  RsIl 


I°p+I°s 
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where  7”  and  I°s  are  the  intensities  of  the  p-  and  s-components  of 
the  incident  wave  whose  intensity  is  7°  =  Inv  +  7“. 

31.5.7  The  transmission  factor,  or  transmittance,  T  is  the  ratio 
of  the  intensity  of  the  transmitted  (refracted)  wave  to  that  of 
the  incident  wave.  Thus 


T  = 


/  -41*  \2 
l  A°  )  ’ 


The  transmission  factors  for  the  p-  and  s-waves  are 


4  cos2  i  sin  i  sin  r 
sin2  (i  +  r)  cos2  (i  —  r)  ’ 


Ts  = 


4  cos2  i  sin  i  sin  r 
sin2  (i  +  r) 


In  particular,  upon  normal  incidence  of  waves  on  the  interface 
between  media  (i  —  r  —  0), 


TP  =  TS = 


4re21 

(n21  +  l)2 


31.5.8  When  n2}  =  {njn-f)  <  1,  the  refractive  angle  is  greater 
than  the  angle  of  incidence:  sin  r  =  sin  iln2\  and  r  >  i.  The 
angle  of  incidence  at  which  the  refractive  angle  becomes  equal  to 
nl 2  is  called  the  critical  angle  £cr.  Angle  irr  =  arc  sin  n2i-  When 
i  ^  £cr,  the  intensities  of  the  reflected  and  incident  waves  are 
the  same,  i.e.  the  wave  is  completely  reflected  from  the  interface 
between  the  media  (R  —  1).  This  phenomenon  is  called  total 
internal  reflection. 


31.6  Doppler  Effect 

31.6.1  Upon  motion  of  the  source  and  detector  of  electromagnetic 
waves  toward  each  other  the  Doppler  effect  is  observed  (30.6.1). 
The  laws  governing  this  phenomenon  for  electromagnetic  waves 
can  be  established  on  the  basis  of  the  special  theory  of  relativity. 
Assume  the  detector  D  to  be  stationary  with  respect  to  an  inertial 
frame  of  reference  K,  and  that  the  source  S  travels  with  respect 
to  K  along  the  positive  direction  of  axis  OX  at  the  velocity  V 
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(Fig.  31.8).  Source  S  is  stationary  in  frame  of  reference  K' 
and  is  located  at  its  origin  of  coordinates.  The  axes  of  coordinate 
.systems  K  and  K'  are  parallel  pairwise  (axis  O'X'  coincides 
with  axis  OX).  Illustrated  in  Fig.  31.8  is  the  position  of  source  S 
at  the  instant  of  time  t  =  t'  —  0.  According  to  Einstein’s  rela- 


Fig. 31.8 


tivity  principle  (5.1.2),  the  equation  of  a  spherical  monochro¬ 
matic  wave  (30.2.8),  sent  by  the  source  at  this  instant  of  time 
in  the  direction  of  the  detector  D,  is  of  identical  form  in 
frames  of  reference  K  and  K 


ao  ei(at+kx  cos  0 +hy  sin  0+6) 


and 

y  ao  ei(a't'+h'x'  cos  Q'+h’y'  sin  G'+6') 


Here  to'  =  w0  and  <n  are  the  angular  frequencies  of  vibration 
of  the  source  and  detector,  k  =  «/c  and  k'  —  to'/c  are  the 
wavenumbers  (it  is  assumed  that  the  wave  propagates  in  va¬ 
cuum),  and  0  and  0'  are  the  angles  between  the  directions  of 
observation  and  of  velocity  V  (OX  axis),  measured  in  the  frames 
of  reference  of  the  detector  ( K )  and  for  the  source  ( K '). 
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Expression  (b)  should  be  obtained  from  expression  (a)  by  re¬ 
placing  the  variables  x,  y  and  t  by  x' ,  y'  and  t'  in  accordance 
with  Lorentz’s  transformations  (5.3.2): 


co +  /cV  cos  0'  -f  k'y'  sin  0'  +  8' 


=  w  (  t'tVx'.lc2  )  +  fc  cos 0  +  fey'  sin 0+8. 

v  /l  —  (Vic*)  >  v  Y 1 — (F/c)2  ' 


-(Vic*) 

Consequently, 


/  l-(V/c*) 
k'  cos  0' 


/l  — (F/c)2 


(  1  +  COS  0  j  , 

(cose+-r) 


» 


k'  sin  0'  =  k  sin  0  and  8'  =  8. 

Hence,  the  relations  describing  the  Doppler  effect  for  electro¬ 
magnetic  waves  in  vacuum  are  of  the  form: 


©0  V  l-(F/c)2 

v 

1  -\ - COS  0 

c 


and 


v0  Yi  —  (Vfe)* 
V 

1  -) - -  cos  0 

c 


31.6.2  At  low  velocities  of  the  wave  source  with  respect  to  the 
detector  (F  <r  c),  the  relativistic  formula  for  the  Doppler  effect 
(31.6.1)  coincides  with  the  classical  formula  (30.6.3): 

v  as  v0  ^  1 - cos  0  j  , 

When  the  source  travels  with  respect  to  the  detector  along  the 
straight  line  that  joins  them  (0  =  0  or  it),  the  longitudinal 
Doppler  effect  is  observed.  If  the  source  and  detector  approach 
each  other  (0  =  n) 

,  f  1  +  F/c 

v  =  vo  |/  +ZT+/7  >  V»’ 

whereas  if  they  travel  away  from  each  other  (0  =  01 


1  —  Vic 
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31. 0.3  Following  from  relativistic  theory  of  the  Doppler  effect 
(31.6.1)  is  the  existence  of  the  transverse  Doppler  effect.  It  is 
observed  when  0  =  n/2  and  0  =  3it/2,  i.e.  when  the  source 
travels  perpendicular  to  the  line  of  observation.  Thus 

VaV0  Y 1  — -(F/c)2. 

The  transverse  Doppler  effect  is  substantially  weaker  than  the 
longitudinal  effect  (due  to  the  smallness  of  Vic).  This  effect  is 
due  to  the  different  passage  of  time  in  the  frames  of  reference  in 
which  the  detector  and  source  of  waves  are  fixed.  Therefore, 
the  experimental  detection  of  the  transverse  Doppler  effect 
was  one  of  the  most  vital  experimental  proofs  of  the  special  theo¬ 
ry  of  relativity. 

31.6.4|The  Doppler  effect,  associated  with  the  thermal  motion  of 
the  gasl  atomsj  that  radiate  light,  leads  to  Doppler  broadening 
nf  spectral  lines.  ;The  range  of  frequencies  of  light  recorded  by 
l  lie  detector  extends  from  v„  —  AvD/2|j  to  v0  -f-  Avp/2,  where 
v0  is  the  frequency  of  monochromatic  radiation  of  a  stationary 
atom.  The  amount  of  Doppler  broadening  Avd  a  Y  T/m, 
where  T  is  the  absolute  temperature  and  m  is  the  mass  of  the 
a  tom. 
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CHAPTER  32  INTERFERENCE  OF  LIGHT 


32.1  Monochromaticity  and  Time  Coherence  of  Light 

32.1.1  Optics  is  the  branch  of  physics  concerned  with  the  study 
of  the  nature  of  light  (31.4.3),  the  laws  of  its  production,  propa¬ 
gation  and  interaction  with  matter.  Wave  optics  deals  with  opti¬ 
cal  phenomena  in  which  the  wave  nature  of  light  is  manifested 
(for  instance,  interference,  diffraction,  polarization  and  disper¬ 
sion  of  light).  Since  light  comprises  electromagnetic  waves,  wave 
optics  is  based  on  Maxwell’s  equations  (27.4.3)  and  the  rela¬ 
tions  that  follow  from  them  for  electromagnetic  waves  (31.1). 
Classical  wave  optics  deals  with  media  that  are  linear  with 
respect  to  their  optical  properties  (30.1.7),  i.e.  media  in  which 
the  dielectric  constant  er  and  the  magnetic  permeability  ur  are 
independent  of  the  intensity  of  the  light  (31.2.4).  Consequently, 
the  principle  of  superposition  (30.4.1)  is  valid  in  wave  optics. 
Phenomena  that  are  observed  in  the  propagation  of  light  in 
nonlinear  media  are  investigated  in  nonlinear  optics.  Nonlinear 
optical  effects  become  appreciable  at  the  very  high  intensities  of 
light  radiated  by  powerful  lasers  (39.6.8). 

32.1.2  It  has  been  experimentally  established  that  the  effect  of 
light  on  a  photocell,  photographic  film,  fluorescent  screen  or 
other  devices  that  record  light  is  determined  by  the  electric 
vector  E  of  the  electromagnetic  field  set  up  by  the  light  wave. 
The  same  conclusion  is  reached  by  classical  electronic  theory, 
according  to  which  the  processes  caused  in  matter  by  light  are 
associated  with  the  effect  of  the  field  of  the  light  wave  on  charged 
particles  of  matter,  i.e.  on  the  electrons  and  ions.  The  frequency 
of  visible  and  more  short-waved  light  is  so  high  (v  5s  1016  Hz) 
that  only  electrons  can  execute  forced  vibration  anywhere  near 
appreciable  in  amplitude.  The  force  exerted  by  the  electro¬ 
magnetic  field  (24.1.5)  on  an  electron  is 

F  =  —  e  {E  -f  [vjB))  =  —  e  {E  +  prp0  KH]}. 
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1 1  (to  — e  and  Vi  are  the  charge  and  velocity  of  the  electron, 
uni  1?  firp0II  is  the  vector  of  magnetic  induction.  It  follows 
from  Sect.  31.1.5  that  the  absolute  value  of  the  magnetic  com¬ 
ponent  of  force  F  is  substantially  less  than  its  electrical  com¬ 
ponent.  Thus 

I'rl'ol  [VjHJl  <  = -^- £  <  E  and  F  — eE, 

because  the  velocity  of  electromagnetic  waves  is  v  a;  10s  m/s, 
and  the  velocity  of  the  electron  in  the  atom  upon  forced  vibra¬ 
tions  set  up  by  light  is  iq  »  10e  m/s. 

112.1.3  In  superposing  light  from  two  nonlaser  sources*  (for 
instance,  two  identical  gas-discharge  lamps),  or  even  from 
different  portions  of  the  same  source,  no  interference  (30.5.2) 
is  observed.  Consequently,  independent  sources  of  light  are 
incoherent  (30.5.1)  and  their  radiation  is  nonmonochromatic 
(31.1.6).  The  reason  for  this  resides  in  the  very  mechanism  by 
which  light  is  radiated  by  the  atoms  (or  molecules  or  ions) 
cd'  the  source.  The  excited  atom  radiates  during  an  extremely 
short  de-excitation  time,  of  the  order  of  x  as  10~8  s  (31.3.4), 
after  which  it  returns  to  its  normal  (unexcited)  state,  having 
expended  its  surplus' energy  on  radiation.  Following  a  certain 
period  of  time  the  atom  may  again  be  excited,  obtaining  energy 
externally,  and  begin  to  emit  radiation  again.  Such  intermittent 
radiation  of  light  by  atoms  in  the  form  of  separate  short  im¬ 
pulses—  wave  trams,  or  packets— is  typical  of  any  light  source, 
regardless  of  the  specific  processes  that  occur  in  the  source  and 
excite  its  atoms.  In  spontaneous  radiation  (39.6.6),  the  atoms 
emit  radiation  independently  of  one  another  with  random  initial 
phases  that  vary  without  any  order  from  one  event  of  emission 
to  another.  Therefore,  spontaneously  radiating  atoms  are  inco¬ 
herent  sources  of  light. 

induced,  or  stimulated,  omission  of  radiation  is  quite 
another  matter.  It  is  produced  in  a  nonequilibrium  (active) 
medium  by  the  action  of  a  variable  electromagnetic  field  (39.6.1). 
Stimulated  emission  of  radiation  by  all  the  particles  of  a  system 


*  This  refers  to  ordinary  light  sources  (incandescent  lamps, 
discharge  lamps,  electric  arcs,  etc.),  based  on  the  principle  of 
spontaneous  radiation  (39.0.6). 
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is  coherent  with  the  monochromatic  radiation  that  it  is  produced 
by,  having  the  same  frequency,  polarization  and  direction  of 
propagation.  These  features  of  stimulated  emission  of  radiation 
are  made  use  of  in  lasers  and  masers  (39.8.8). 

32.1.4  A  real  wave,  radiated  during  a  limited  period  of  time 
and  encompassing  a  limited  region  of  space,  is  not  monochroma¬ 
tic.  The  spectrum  of  its  angular  frequencies  (28.4.7)  is  of  finite 
width  Acd,  i.e.  it  includes  angular  frequencies  from  co  —  Aco/2 
to  co  -f-  Aco/2.  During  the  time  interval  At  -c  Tcoh  =  it/Aco 
such  a  wave  can  be  considered  as  an  approximately  monochroma¬ 
tic  wave  with  the  angular  frequency  co.  The  quantity  T00p  is 
called  the  time  coherence  of  a  nonmonochromatic  wave.  During 
the  time  interval  xcot,  the  phase  difference  of  the  vibrations 
corresponding  to  waves  of  the  frequencies  to  -)-  Aco/2  and  co  — 
Aco/2  changes  by  n.  A  wave  with  the  angular  frequency  to  and 
phase  velocity  v  propagates  during  this  time  over  the  distance 
Ieoh  =  t'Tcoh  =  Jta/Aco. 

The  quantity  lB0 ^  is  called  the  coherence  length,  or  the  harmonic 
wave  train  length,  of  the  corresponding  nonmonochromatic  wave 
being  considered.  The  closer  a  given  wave  is  to  a  monochromatic 
one,  the  less  the  width  Aco  of  its  frequency  spectrum  and  the 
greater  its  coherence  time  and  coherence  length.  For  example, 
for  visible  sunlight,  having  a  continuous  frequency  spectrum 
from  4  X  10u  to  8  X  10w  Hz,  xCph  »  10  ~M  s  and  lcoh  x 
»  10~6  m.  The  coherence  time  of  stimulated  radiation  is  sub¬ 
stantially  longer  than  the  de-excitation  time  of  the  atom  (31.3.4). 
For  example,  Tcot,  reaches  10~6  s  for  a  continuous-wave  laser 
and  the  coherence  length  Zcop  «  10s  m. 

32.2  Interference  of  Light.  Spatial  Coherence  of  Light 

32.2.1  To  obtain  coherent  light  waves  by  means  of  conventional 
(nonlaser)  light  sources,  light  from  a  single  source  is  split  into 
two  or  more  systems  of  waves.  Each  system  represents  the 
radiation  of  the  same  atoms  of  the  source.  Hence,  by  virtue  of 
their  common  origin,  these  systems  of  waves  are  coherent  with 
one  another  and  interfere  when  superposed.  Light  can  be  split 
into  coherent  systems  of  waves  by  its  reflection  or  refraction. 
Illustrated  as  an  example  in  Fig.  32.1  is  the  principle  of  the 
Fresnel  mirrors.  Light  from  point  source  S  is  incident  on  two 
plane  mirrors  AxO  and  A?0  arranged  perpendicular  to  the 
plane  of  the  drawing  and  joined  along  line  0.  The  angle  a  be- 
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tween  the  planes  of  the  mirrors  is  very  small.  Light  from  source  S 
propagates  after  being  reflected  as  two  beams  having  their 
centres  at  points  and  S ,,  which 
are  the  virtual  images  of  source 
S  in  the  mirrors.  These  beams 
are  coherent  and  upon  being  su¬ 
perposed  produce  an  interference 
pattern  on  screen  E  (the  region 
BC  is  sometimes  called  the  field 
of  Interference).  The  result  of  the 
interference  at  some  point  M  on 
the  screen  depends  upon  the  wave¬ 
length  A.  of  the  light  and  the  path 
difference  of  the  waves  (30.5.2)  from 
the  coherent  virtual  sources  Sx 
and  S2  to  point  M.  Thus 

A  -  -  ra  —  j'i  =  DdS^  —  MS\. 

The  initial  phases  of  vibration  of  sources  and  S2  are  the 
same.  Hence,  the  conditions  for  interference  maxima  and  minima 
(30.5.2)  are  of  the  form 

ra  —  rx  =  ±  mA,  fob  a  maximum  of  the  mth  order 

(m  =  0,  1,  2,...), 

r2  —  rx  =  ±(2m  —  l)-^-  for  a  minimum  of  the  mth  order 
(m  =  1,  2,...). 

The  angle  2o>  at  apex  5  between  the  two  rays  of  light  that  con¬ 
verge  at  point  M  of  the  interference  pattern  after  being  reflected 
from  mirrors  AxO  and  A20  is  called  the  aperture  angle  of  the 
interference.  This  angle  usually  varies  very  little  upon  changing 
the  position  of  point  M  within  the  limits  of  the  field  of  inter¬ 
ference. 

32.2.2  The  principles  of  the  techniques  used  for  observing  inter¬ 
ference  by  means  of  the  Fresnel  biprism  (Fig.  32.2)  and  the 
Billet  split  lens  (Fig.  32.3)  are  similar  to  those  of  the  Fresnel 
mirrors.  The  biprism  consists  of  two  identical  triangular  prisms 
joined  together  at  their  bases  and  prepared  as  a  whole.  The 
refracting  angles  a  at  the  upper  and  lower  edges  of  the  biprism 
are  very  small  (of  the  order  of  a  fraction  of  one  degree).  Light 
from  source  S  is  refracted  in  the  biprism  and  propagates  in  it 


Fig  32.1 
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in  the  form  of  two  systems  of  waves,  corresponding  to  the  coherent 
virtual  sources  of  light  Si  and  S2.  The  interference  of  these  waves 
is  observed  in  the  region  of  their  overlapping  on  screen  E. 

The  Billot  split  lens  consists  of  two  halves  Lx  and  L2  of  a  collect¬ 
ing  lens,  cut  along  a  diameter.  The  two  halves  are  slightly 
displaced  laterally  (spread  apart).  In  consequence,  they  produce 
two  noncoinciding  real  images  Sj  and  S2  of  the  point  source  of 


light  S.  The  interference  of  light  from  these  coherent  secondary 
sources  is  observed  on  screen  E.  The  space  between  parts  /.,  and 
L2  of  the  split  lens  is  covered  by  opaque  screen  A . 

The  value  of  the  aperture  angle  2©  of  interference  is  shown  in 
Figs.  32.2  and  32.3  for  the  central  point  M0  of  the  interference 
pattern  obtained  by  means  of  a  biprism  and  a  split  lens. 
32.2.3  The  width  of  an  interference  fringe  (as  the  bands  are 
called)  is  the  distance  between  two  adjacent  interference  maxima 
(or  minima).  When  the  Fresnel  mirrors  or  similar  devices  (bi- 
prisms  or  split  lenses)  are  employed  to  obtain  interference,  the 
width  of  an  interference  fringe  (30.5.2)  is  XL /l.  Here  l  is  the 
distance  between  sources  Si  and  S2,  and  L  is  the  distance  from 
them  to  screen  E.  The  wavelength  of  visible  light  is  extremely 
short  (1»5X  10'7  m).  Therefore,  to  obtain  an  interference 
fringe  of  a  width  that  can  be  resolved  by  the  eye,  it  is  necessary 
to  comply  with  the  condition:  Z  <C  L.  Consequently,  the  angle  a 
in  the  Fresnel  mirrors  and  the  refracting  angles  a  of  the  biprism 
must  be  very  small. 
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The  possibility  of  observing  interference  fringes  also  depends  on 
l  lie  degree  of  contrast,  i.e.  on  the  degree  of  different  illumination 
ol'  the  screen  at  the  maxima  and  minima.  The  illuminance  is 
proportional  to  the  intensity  1  of  the  incident  light.  A  quanti¬ 
tative  characteristic  of  the  degree  of  contrast  of  the  interference 
pattern  is  the  dimensionless  quantity  called  the  visibility  of 
fringes 

p  —  Anax — fmin 
^max  +  ^min  * 

where  7max  and  /mln  are  the  values  of  the  light  intensities  in  the 
interference  maxima  and  minima  on  the  screen.  The  human  eye 
reliably  distinguishes  fringes  if  their  visibility  V  >  0.1, 
i.e.  when  lmiD  <  0.82  /max. 

In  superposing  two  likewise  polarized  coherent  monochromatic 
waves,  with  the  amplitudes  and  intensities  A Ju  A2  and  I2, 
the  visibility  of  the  fringes  is 

2 AtAt  __  2  YT[T2 
A*  +  A%  1 1  +  /2  * 

The  visibility  of  theiringes  is  a  maximum  (F  =  1)  when  A1  = 

—  A  2* 

32.2.4  In  an  interference  arrangement  of  the  type  of  the  Fresnel 
mirrors,  illuminated  by  a  point  source  S  (Fig.  32.1),  the  super¬ 
posed  waves  are  .never  ideally  monochromatic  (32.1.3).  Hence, 
these  waves  are  only  partly  coherent.  They  are  capable  of  inter¬ 
ference  only  under  the  condition  that  the  vibration,  excited  by 
them  at  point  M  being  considered  on  the  screen,  corresponds  to 
one  and  the  same  harmonic  train,  or  packet,  of  radiation  of 
source  S  (32.1.4)  i.e.  if 

I  r2  —  it  |  <  vxooh  or  |  r2  —  7-j  |  <  lcob. 

Here  r2  —  rt  is  the  path  difference  of  the  superposed  waves,  v  is 
their  velocity,  and  xc0h  and  lcob  are  the  coherence  time  and 
coherence  length  of  light  from  source  S  (32.1.4).  Partly  coherent 
vibrations  are  combined  at  point  M.  These  vibrations  are  excited 
by  the  same  source  S  at  different  instants  of  time,  t  and  t  f  t, 
where  x  —  |  r2  —  rx  \/v.  Therefore,  the  visibility  of  the  inter¬ 
ference  pattern  in  such  arrangements  essentially  depends  on  the 
time-coherence  of  the  vibrations,  which  is  limited  by  the  mono- 
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chromaticity  of  the  light  from  source  S,  i.e.  its  coherence  time 
Tcoh  (32.1.4).  At  x<  tcoi,,  the  superposed  vibrations  are  practi¬ 
cally  fully  coherent  and  the  visibility  of  the  interference  fringes 
(at  equal  intensity  of  the  superposed  waves)  V  x  1.  If,  however, 
Tcolj,  the  superposed  vibrations  are  incoherent  and  there 
is  no  interference  (V  =  0). 

Thus,  to  observe  the  interference  of  light  at  large  path  differ¬ 
ences  r2  —  i\  (and  corresponding  large  values  of  x)  it  is  necessary 
for  the  light  to  have  a  sufficiently  long  coherence  time,  i.e.  a 
sufficiently  high  degree  of  monochromaticity. 

32.2.5  The  positions  on  the  screen  of  all  the  interference  maxima, 
with  the  exception  of  the  zero-order  maximum,  depend  upon 
the  wavelength  of  the  light.  For  two  wavelengths  Xt  and  X2,  the 
higher  the  order  m  to  be  observed,  the  more  their  maxima  of 
the  mth  order  are  displaced  from  each  other.  Therefore,  as  m  is 
increased,  the  visibility  of  interference  fringes,  obtained  by  the 
illumination  of  Fresnel  mirrors  with  nonmonochromatic  light, 
is  impaired.  Fringes,  corresponding  to  different  values  of  X, 
are  superposed  on  one  another,  and  the  interference  pattern  is 
blurred. 

Assume  that  the  wavelength  of  light  is  within  the  limits  from  X 
to  X  AX,  and  the  angular  frequency,  from  to  to  oi  —  Ate, 
where  Aw  =  2nvAXlX2.  Then,  according  to  the  Rayleigh  criteri¬ 
on,  the  interference  pattern  is  still  distinguishable  up  to  a  maxi¬ 
mum  of  the  m0  order  for  light  with  the  wavelength  X  +  AX 
(where  AX  >  0),  which  is  superposed  on  the  screen  on  the  nearest 
interference  minimum  for  light  of  wavelength  X: 

X  X 

m0  (X+AX)  =  (2m0  +  1)  >  irom  which  »t0  =  . 

Thus,  interference  can  be  observed  at  path  differences  of  the 
waves  that  comply  with  the  condition 

,  ^  inv 

|r*“ril<  AST55”0011' 

This  result  agrees  with  the  assessment  made  in  Sect.  32.2.4  on 
the  basis  of  the  concept  of  time  coherence  of  vibrations. 

32.2.6  Partly  coherent  light,  of  a  total  intensity  /,  can  be  dealt 
with  as  the  combination  of  two  components:  the  coherent  com¬ 
ponent  of  intensity  yl,  where  y  is  the  degree  oj  coherence  of  the 
light,  and  the  incoherent  component  of  the  intensity  (1 — y)  I. 
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When  partly  coherent  waves  are  superposed,  only  their  coherent 
components  interfere.  The  incoherent  components  produce  the 
uniformly  illuminated  background  of  the  interference  pattern. 
Therefore,  as  the  degree  of  coherence  of  the  light  is  reduced,  the 
visibility  V  of  the  interference  fringes  decreases.  Thus 


V  = 


2  Yhh  r 
h  +  h 


When  the  intensities  of  partly  coherent  waves  are  the  samet 
V  —  y. 

32.2.7  As  a  rule,  in  an  interference  device  with  mirrors  (or 
a  biprism),  a  brightly  illuminated  narrow  slit,  parallel  to  the 
joint  0  between  the  mirrors,  is  used  instead  of  a  point  source  of 
light.  In  this  case,  the  interference  patterns,  obtained  on  the 
screen  from  the  various  portions  along  the  length  of  the  slit, 
are  displaced  with  respect  to  one  another  in_the  direction  of 
the  slit  S.  Accordingly,  a  system  of  interference  fringes,  parallel 
to  joint  0  of  the  mirrors,  is  observed  on  the  screen. 

The  visibility  of  the  interference  fringes  is  reduced  as  the  width 
of  the  slit  S  is  increased.  This  is  due  to  the  fact  that  the  inter¬ 
ference  fringes  that  could  be  obtained  on  the  screen  from  various 
narrow  slits  into  which  slit  S  could  be  conceivably  divided 
would  be  displaced  with  respect  to  one  another.  An  interference 
pattern  in  monochromatic  light  of  wavelength  X  is  sharp  and 
distinct  if  the  following  approximate  condition  is  complied 
with: 

b  sin  w  <  Xi 4, 

where  b  is  the  slit  width  and  2co  is  the  aperture  angle  of  inter¬ 
ference  (32.2.1). 

32.2.8  The  principle  of  Young's  two-slit  interference  method  is 
illustrated  in  Fig.  32.4.  The  source  of  light  is  a  brightly  illumi¬ 
nated  narrow  slit  S  in  screen  A  *.  Light  from  slit  S  falls  on  a  sec¬ 
ond  opaque  screen  A2  having  two  identical  narrow  slits  Si  and 
S a,  parallel  to  slit  S.  Two  systems  of  cylindrical  waves  are 
propagated  in  the  space  beyond  screen  A  2;  their  interference  is 
observed  on  screen  E.  The  visibility  of  the  interference  fringes  for 
small  path  differences  is  determined  mainly  by  the  degree  of 
agreement  in  the  way  the  vibrations  are  executed  at  points  of 
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slits  Sx  and  S2 .  These  slits  can  be  considered  as  the  “sources”  of 
the  waves  interfering  on  the  screen. 

32.2.9  The  coherence  of  vibrations  executed  at  the  same  instant 
of  time  at  various  points  of  a  plane  Q,  perpendicular  to  the  direc¬ 
tion  of  wave  propagation,  is  called  spatial  coherence  (as  distin¬ 
guished  from  time  coherence  of 
vibrations,  executed  at  one 
and  the  same  point,  but  at 
different  instants  of  time). 
Spatial  coherence  depends 
upon  the  conditions  for  the 
radiation  and  formation  of 
light  waves.  For  instance,  a 
lightwave  emitted  by  a  point 
source  has  complete  spatial 
coherence.  In  the  case  of  an 
ideal  plane  wave,  the  am¬ 
plitude  and  phase  of  the  vi¬ 
bration  are  the  same  at  all  points,  of  plane  Q,  i.e.  there  is  also 
complete  spatial  coherence.  Spatial  coherence  is  also  maintained 
over  the  whole  cross  section  of  a  light  beam  emitted  by  a  laser. 
In  a  real  wave,  emitted  by  a  host  of  independent  atoms  of  an 
extended  nonlaser  light  source,  the  phase  difference  of  the 
vibrations  at  two  points  K t  and  K2  of  plane  Q  is  a  random  func¬ 
tion  of  time.  The  random  variations  of  the  phase  difference  in¬ 
crease  with  the  distance  between  the  points.  The  length  of  spatial 
coherence  is  the  distance  lsc  between  points  K1  and  K2  of  plane 
Q  at  which  the  random  changes  of  phase  difference  reach  a  val¬ 
ue  equal  to  it.  If  in  Young’s  two-slit  interference  arrangement 
the  distance  l  between  slits  Sx  and  S2  is  greater  than  or  equal  to 
l  sc  the  visibility  of  the  interference  fringes  equals  zero.  To 
ensure  spatial  coherence  in  the  illumination  of  slits  Sx  and  S2, 
the  width  b  of  inlet  slit  S  should  be  sufficiently  narrow.  Thus 

b  <  Ad/Z  and  0  <  A/Z, 

where  d  is  the  distance  between  screens  Ax  and  A  2,  and  0  =  bid 
is  the  angular  dimension  of  the  light  source,  i.e.  slit  S. 

The  length  of  spatial  coherence  Zsc  =  A/0  increases  with  the 
distance  from  the  light  source.  For  example,  for  a  star  of  dia¬ 
meter  D,  located  at  the  distance  r,  0  =  Dir  and  ls c  =  A r/D. 
The  area  of  a  circle  of  radius  Zsc  is  called  the  dimension  of  spatial 
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coherence ,  and  the  volume  of  a  right  cylinder  having  such  a  circle 
ns  its  base  and  a  height  equal  to  length  of  the  harmonic  wave 
I  rain  (32.1.4)  lco j,  =  ei:c  ohi  is  called  the  volume  of  coherence. 

32.3  Interference  of  Light  in  Thin  Films 

32.3.1  Examples  of  light  interference  observed  under  everyday 
conditions  are  the  iridescent  colouring  of  thin  films  (soap  bubbles, 
liims  of  petroleum  or  other  oils  on  the  surface  of  water,  trans¬ 
parent  films  of  oxides  on  the  surfaces  of  heat-treated  metal  com¬ 
ponents,  called  temper  colours,  etc.).  The  formation  of  partly 
coherent  waves,  displaying  interference  upon  superposition, 
occurs  in  these  cases  due  to  the  reflection  of  light,  incident  on  the 
lilm,  from  its  upper  and  lower  surfaces.  This  interference  depends 
upon  the  phase  difference  acquired  by  the  superposed  waves  in 
the  film  and  depending  upon  their  optical  path  difference. 

32.3.2  The  optical  path,  or  distance,  s  of  light  is  the  product  of 
the  geometric  path  length  l  travelled  by  light  in  the  medium 
by  the  refractive  index  of  the  medium  (31.5.1):  s  =  nl.  The 
quantity  s  is  equal  to  the  path  travelled  by  light  in  vacuum 
during  the  time  it  covers  the  path  l  in  the  given  medium. 

The  optical-path  difference  of  two  waves  is  the  difference  in  the 
optical  path  of  these  waves:  As  =  s2  —  s ,  (the  optical-path 
difference  is  frequently  denoted  by  A  or  o).  An  optical-path 
difference  As  corresponds  to  a  change  in  the  phase  by  A®  - 
=  2nAs/X0,  where  X0  =  nX is  the  wavelength  of  the  light  in  vacuum, 
X  is  the  light  wavelength  in  a  medium  with  the  refractive  index  n. 
Paths  of  propagation  of  waves,  having  the  same  optical  length, 
arc  said  to  be  taulochronous.  Light  requires  equal  amounts 
of  time  to  travel  them.  In  an  optical  system  (microscope,  tele¬ 
scope,  etc.),  for  example,  all  possible  paths  of  the  rays  of  light 
from  some  point  on  the  object  to  the  corresponding  point  on  its 
image  are  tautochronous. 

32.3.3  Assume  that  a  plane  monochromatic  light  wave  1  is 
incident  at  the  angle  i  on  a  plane-parallel  homogeneous  plate  of 
thickness  d  and  transparent  to  light  (Fig.  32.5).  Owing  to  the 
reflection  of  light  from  the  upper  and  lower  surfaces  of  the  plate, 
two  plane  waves  propagate  in  the  directions  of  the  reflected  rays 
V  and  1" .  Their  optical-path  difference  is 

A s  =  n  (AB+BC)  —  nx  (ZD-f =2  dn  cos  r - , 
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where  and  n  are  the  refractive  indices  of  the  surrounding 
medium  and  of  the  plate,  D  is  the  foot  of  a  perpendicular  erected 
on  ray  1'  from  point  C,  member  XJ2  takes  into  account  the 
phase  difference  of  it  as  light  is  reflected  from  point  A  (31.5.5),* 

r  is  the  refractive  angle,  and 
X0  is  the  wavelength  of  the 
light  in  vacuum. 

The  conditions  for  interference 
maxima  of  reflection  are 


2 dn  cos  r  =  (2m-f- 1) 


or 


2d  cos  r  =  (2m  +  l)  —  ,  where 

m  =  0,  1,  2,  ...  is  the  order 
pjg  32  5  of  the  interference  maximum. 

The  conditions  for  interference  minima  of  reflection  are 


2 dn  cos  r  =  mX0  or  2d  cos  r  =  mX, 

where  m  —  1,2,  ...  is  the  order  of  the  interference  minimum. 
The  optical-path  difference  for  the  waves  passing  through  the 
plate  (rays  2'  and  2")  is  A s  —  2 dn  cos  r,  i.e.  it  differs  from  As 
for  the  reflected  light  by  X0/2.  Therefore,  the  maxima  of  reflection 
correspond  to  the  minima  of  transmission  of  light  and  vice  ver^a. 
If  the  plate  is  illuminated  with  white  light,  it  has  additional 
colouring  in  the  reflected  and  transmitted  light. 

The  maximum  thickness  d  of  the  plate  at  which  it  is  still  possi¬ 
ble  to  observe  interference  fringes  is  limited  by  the  coherence  time 
Tcoh  °f  the  light  (32.1.4).  Thus 

2 dn  cos  r  <  ctc0h  or  2d  cos  r  <  lcnh, 

where  lco h  =  axCOh  =  cxc0(,/«  is  the  coherence  length. 

32.3.4  In  the  calculation  of  the  optical-path  difference  of  inter¬ 
fering  waves  in  a  plate  (32.3.3)  only  two  waves  were  taken  into 


*  It  is  assumed  that  i  <  isr  and  that  n  >  When 
n  <  «j,  a  phase  difference  of  it  occurs  upon  reflection  of  light 
from  point  B  and  As—2dn  cos  r  +  ?i,0/2,  i.e.  it  differs  from 
the  value  of  As  for  the  case  when  re  >  re*  by  X0,  but  this 
has  no  effect  on  the  result  of  the  interference. 
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account.  They  correspond  to  the  first  reflection  from  the  upper 
■mil  from  the  lower  surfaces  of  the  plate;  the  possibility  of  mul¬ 
tiple  reflection  of  light  was  neglected.  Such  simplification  is 
justified  only  if  the  intensity  J2  of  the  wave  corresponding  to  the 
second  reflection  from  the  lower  surface  of  the  plate  is  substanti¬ 
ally  less  than  the  intensity  7,  of  the  wave  corresponding  to  the 
first  reflection.  If  R  is  the  coefficient  of  reflection  (31.5.6)  from 
the  upper  and  lower  surfaces  of  the  plate,  then  J2  =  As 
n  rule  /?*•<  1.  At  the  interface,  for  instance  between  air  and 
glass  (nn  —  1.5),  at  angles 
of  incidence  i  <  50°,  the 
coefficient  of  reflection  R  < 

<  0.05.  In  certain  special 
cases,  when  7,  is  compar¬ 
able  to  it  is  necessary 
to  deal  with  multiwave  inter¬ 
ference  (32.4). 

32.3.5  In  considering  the 
interference  of  light  in  thin 
films,  distinction  is  made 
between  interference  fringes 
of  equal  inclination  (iso¬ 
clinic  fringes)  and  those  of 
equal  thickness  (isopachic 
fringes).  Equal-inclination  fringes  are  observed  when  a  diverging 
(or  converging)  beam  of  light  is  incident  at  various  angles  i  on 
n  thin  plane-parallel  film.  These,  for  instance,  are  the  conditions 
that  prevail  when  the  film  is  illuminated  by  an  extended  source 
or  scattered  sunlight.  Since  d  and  n  arc  the  same  all  over  the 
film,  the  optical-path  difference  of  the  interfering  waves  varies 
along  the  surface  of  the  film  only  owing  to  the  variation  in  the 
angle  of  incidence  i.  The  conditions  of  interference  are  the  same 
for  all  rays  incident  on  the  surface  of  the  film  and  reflected  from 
it  at  the  same  angle.  Consequently,  the  interference  pattern  in 
Ibis  case  is  said  to  be  made  up  of  equal-inclination  fringes. 
Fqual-inclination  fringes  are  observed  on  screen  E  arranged  in 
the  focal  plane  of  a  convergent  lens  L  (Fig.  32.6).  With  no  lens 
available  the  interference  pattern  could  only  be  observed  at 
infinity,  at  the  point  of  intersection  of  the  parallel  rays  T  and 
7",  2'  and  2"  etc.  Therefore,  equal-inclination  fringes  are  said  to 
be  localized  at  infinity.  For  their  visual  observation  it  would  be 
necessary  for  the  eyes  to  .accommodate  to  infinity. 


Fig.  32.6 
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32.3.6  Equal-thickness  fringes  are  observed  in  reflecting  a  paral¬ 
lel  or  almost  parallel  beam  of  light,  rays  ( i  =  const)  from  a  thin 
transparent  film  whose  thickness  d  is  not  the  same  at  various 
places.  The  optical-path  difference  of  the  interfering  waves  varies 
in  going  from  certain  points  on  the  surface  of  the  film  to  other 
points  with  a  correspondingchangeinthe  film  thickness  d.  Hence 
the  conditions  of  interference  are  the  same  for  points  having 
the  same  values  of  d.  This  is  why  the  interference  pattern  being 
considered  is  said  to  be  made  up  of  equal-thickness  fringes. 
Equal-thickness  fringes  are  localized  close  to  the  surface  of  the 
film.  Hence,  they  can  be  observed 
by  accommodation  of  the  eyes  'prac¬ 
tically  to  the  film’s  surface. 

When  light  interferes  in  a  thin  trans¬ 
parent  wedge  with  a  small  wedge 
angle  a,  equal-thickness  fringes  have 
the  form  of  straight  bands  parallel 
to  the  edge  of  the  wedge.  If  the  wedge 
is  illuminated  by  monochromatic  light 
with  the  wavelength  >.0  in  '  vacuum, 
normally  incident  on  the  wedge  sur¬ 
face  (i  —  0),  the  width  of  the  inter¬ 
ference  fringes  (32.2.3)  is  equal  to 
^0/2raa,  where  n  is  the  absolute  re¬ 
fractive  index  of  the  wedge  material. 
32.3.7  Equal-thickness  fringes  having 
the  .form  of  concentric  r rings,  and 
called  Newton's  rings ,  are  observed  in 
the  interference  of  light  in  the  thin 
air  clearance  between  a  flat  glass 
plate  A  and  a  flat-convex  lens  L 
pressed  tightly  against  the  plate 
(rig.  32.7).  The  flat  surface  of  the  lens  is  parallel  to  the  plate 
surface  and  light  is  normally  incident  to  the  lens  surface.  The 
centre  of  Newton’s  rings  coincides  with  point  O  of  contact 
between  the  lens  and  plate.  At  a  small  distance  r  from  point  0 
the  optical-path  difference  of  the  waves  reflected  from  the  upper 
and  lower  surfaces  of  the  air  clearance  is 

R  r  2  * 
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where  n  x  1  (for  air),  and  R  is  the  radius  of  the  convex  surface 
on  the  lens. 

Note.  In  calculations  concerning  Newton’s  rings,  the  wave  reflect¬ 
ed  from  the  upper  (flat)  surface  of  the  lens  need  not  be  taken 
into  account  because  the  optical-path  difference  between  this 
wave  and  the  waves  reflected  from  the  boundaries  of  the  air 
clearance  is  greater  than  the  coherence  length  for  nonlaser 
light  (32.3.3). 

In  reflected  monochromatic  light  with  the  wavelength  in  air 
X  =  XJn  x  X0,  the  radii  of  the  dark  and  bright  Newton’s 
rings  are 


'‘dark — mR%  and  rprjght — (2m  -(-  1)  g  > 

where  m  =  0,  1,  2,  ...  .  The  dark  spot  in  the  centre  of  the 
pattern  corresponds  to  the  change  in  the  phase  of  the  wave 
by  n  when  it  is  reflected  from  the  lower  surface  of  the  air  clear¬ 
ance.  If  white  light  is  incident  on  the  lens,  a  central  dark  spot 
is  observed  in  the  reflected  light, 
surrounded  by  a  system  of  col¬ 
oured  rings.  These  rings  correspond 
to  the  interference  maxima  of  the 
reflection  of  light  with  various  val¬ 
ues  of  wavelength  X. 


32,4  Multiwave  Interference 

32.4.1.  Special  interference  instru¬ 
ments,  such  as  the  diffraction  grat¬ 
ing  (33.3.4),  the  Fabry-Perot  eta- 
lon  and  others  are  used  to  obtain 
interference  of  many  light  waves 
with  close  or  equal  amplitudes. 

The  amplitude  A  of  the  resultant  vibration  and  its  intensity 
I  =  A2  at  arbitrary  point  M  can  be  found  by  applying 
the  method  of  vector  diagrams  for  adding  oscillations  or  vibra¬ 
tions  having  the  same  direction  (28.4.2). 

32.4.2  Shown  in  Fig.  32.8  is  a  vector  diagram  of  the  addition 
of  vibrations  in  interference  of  N  waves.  At  the  point  M  being 
considered,  these  vibrations  set  up  coherent  vibrations  in  the 
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same  direction  with  equal  amplitudes  At  =  A1  and  a  phase 
difference,  independent  of  i,  between  (t  +  l)th  and  ith  vibra¬ 
tions.  Thus  <T>;+1  (t)  —  <I>j  (t)  —  Acp0.  The  amplitude  of  the 
resultant  vibration  is 


A  —  2  x  00  i 


and  001 
Hence , 


sin  ,  where 


AtPo 

2 


a  =  2n — NA(f0 


A=At 


sin 


iVAcpp 


sin 


Atpo 


sin2 


JVAtpo 


and  /  =  /j- 


sin^ 


Acpo 


where  /x  =  A \  is  the  intensity  of  the  vibration  set  up  at  point  M 
by  each  of  the  N  interfering  waves  separately. 

32.4.3  The  principal  maxima  in  the  interference  of  N  waves 
(32.4.2)  are  at  points  M  that  comply  with  the  condition:  A(p0  = 
=  ±  2»n,  where  n  =  0,  1,  2,  ...  is  the  order  of  the  principal 
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Fig.  32.9 


maximum.  The  amplitude  and  intensity  of  vibration  at  the 
principal  maxima  are: 

Am&x  ~  NAi  and  Im ax  =  N*I j. 
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Interference  minima  (A  — ■  0)  comply  with  the  condition: 
A(p0  =  ±2 np/N,  where  p  may  be  any  positive  whole  number 
except  a  multiple  of  N. 

The  nature  of  the  dependence  of  lfli  on  A<p0  (32.4.2)  is  shown 
in  Fig.  32.9.  Between  each  pair  of  adjacent  interference  minima 
there  is  a  maximum,  either  a  principal  or  a  subordinate  maxi¬ 
mum.  At  high  N  values  the  intensities  of  the  subordinate  maxima 
are  negligibly  small  compared  to  that  of  the  principal  maxima. 
The  values  Acp0  =  ±  (2nn  ±  2n/N)  correspond  to  two  minima 
that  bound  a  principal  maximum  of  the  rath  order.  Therefore, 
the  “width'  of  the  principal  maximum,  equal  to  4n /N,  is  inverse¬ 
ly  proportional  to  the  number  N  of  interfering  waves,  and  its 
intensity  is  proportional  to  N a.  The 
fact  that  the  interference  pattern 
changes  in  such  a  manner  as  N  is 
varied  fully  complies  with  the  law  of 
conservation  of  energy.  The  total  [ener¬ 
gy  of  vibration  at  all  points  of  the 
screen  on  which  the  interference  pat¬ 
tern  is  observed  is  proportional  to  N. 

32.4.4  If  the  number  N  of  interfering 
waves  (32.4.2)  is  increased  indefinitely, 
and  their  amplitudes  Af  and  phase 
differences  A<p0  are  correspondingly  re¬ 
duced  so  that  NAi  and  N  Aq>„  remain 
finite  quantities,  equal  to  A „  and  A<p,  then,  in  the  limit,  the 
vector  diagram  (Fig.  32.8)  assumes  the  form  illustrated  in 
Fig.  32.10.  Vector  A,  the  amplitude  of  the  resultant  vibration, 
closes  the  arc  BC  of  a  circle.  The  length  of  this  arc  is  equal  to  A0 
and  its  central  angle  Z.BOC  —  A<p.  Therefore,  the  radius  of 
the  circle  OB  =  A  (/Acp,  and  the  amplitude  A  and  intensity  I 
of  the  resultant  vibration  equal 


A=An 


sin 


Acp 

2 


Acp 

2 


sin 


and  /  =  Ia 


,  Acp 
2 


whore  Ia  =  A\, 
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Interference  minima  are  at  points  of  the  interference  pattern 
for  which 

Acp  =  ±  2mn  (where  m  =  1,  2,  .  .  .). 

Interference  maxima  are  at  points  for  which  Acp  =  +  2kmn, 
where  m  =  0,  1,  2,  .  .  .  is  the  order  of  the  maximum.  The  values 


Fig.  32.11 


of  factor  km  are  determined  from  the  transcendental  equation 
tan  kmn  =  kmn.  For  the  central,  zero-order  maximum  k0  =  0 
and  Acp  ^0.  The  amplitude  and  intensity  of  vibration  at  a  zero- 
order  maximum  are  equal  to  A0  and  I0.  For  all  other  maxima 
(m>  1)  it  can  be  assumed  approximately  that 

km  =  (2m  +  l)/2  and  Aq>  =  ±  (2m  +  1)  Jt. 

Accordingly,  the  ratio  of  the  intensities  for  the  mth  and  zero- 
order  maxima  equals 

Im  4 

4  _  (2m -f  l,)'2*3  ‘ 
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TABT/E  3  2.1 


Order  of  the  maximum 

0 

1 

2 

3 

4 

/  m/I  o 

i 

0.045 

This  ratio  decreases  rapidly  with  an  increase  of  m  (Table  32.1). 
The  shape  of  the  curve  showing  the  dependence  of  /  on  Aq>  is 
shown  in  Fig.  32.11. 


CHAPTER  33  DIFFRACTION  OF  LIGHT 

33.1  Huygens-Fresnel  Principle 

33.1.1  If  the  position  of  the  wavefront  (30.2.3)  is  known  at  a  cer¬ 
tain  instant  t  of  time,  as  well  as  the  velocity  v  of  the  wave,  the 
position  of  the  wavefront  at  the  subsequent  instant  t  +  A t  can 
he  determined  on  the  basis  of  the  Huygens  principle.  According 
lo  this  principle,  each  point  on  the  surface  S  ( t ),  through  which 
the  wavefront  passes  at  the  instant  t  can  be  regarded  as  a  source 
of  secondary  waves,  and  the  sought-for  position  S  (t  -f-  At)  of 
the  wavefront  at  the  instant  of  time  t  +  At  coincides  with  the 
envelope  of  all  the  secondary  waves  at  this  later  time.  It  is 
assumed  here  that  in  a  homogeneous  medium  the  secondary  waves 
radiate  only  forward,  i.e.  in  directions  making  acute  angles 
with  the  outward  normals  to  the  wavefront.  In  a  homogeneous 
isotropic  medium,  the  secondary  waves  are  spherical  (Fig.  33.1). 

33.1.2  The  Huygens  principle  can  be  used  to  derive  laws  for  the 
reflection  and  refraction  of  light  at  the  interface  between  two 
media.  The  line  MN  in  Fig.  33.2  represents  the  plane  interface 
between  two  media  in  which  the  velocities  of  light  equal  vt 
and  v2.  Incident  on  this  interface  at  the  angle  i  is  a  plane  wave 
(rays  1  and  2).  At  the  instant  t  of  time  the  wavefront  (plane 
AB)  reaches  the  interface  at  point  A.  Hence  point  A  begins 
to  radiate  secondary  waves,  propagating  both  in  the  first  medium 
(reflected  wave)  and  in  the  second  medium  (transmitted  wave). 
During  the  time  At  required  for  the  incident  wave  to  travel  the 
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distance  B  C  (wher  At  =  BClt>i),  the  front  of  the  secondary 
wave,  radiated  fr<  i  point  A,  reaches  points  of  a  semisphere  in 
the  first  medium  having  the  radius  R%  —  i>i& t  —  BC,  and  points 
of  a  semisphere  in  the  second  medium  having  the  radius  R2  — 
—  v2A t  =  v2BCh %.  The  front  ol  the  reflected  wave  (rays  1' 


and  2'),  propagating  at  the  angle  i'  of  reflection  is  the  plane/) C, 
tangent  to  a  sphere  of  radius  Rf  with  its  centre  at  point  A. 
Correspondingly,  the  front  of  the  transmitted  (refracted)  wave 
(rays  1"  and  2"),  propagating  at  the  refraction  angle  r,  is  the 
plane  CE,  tangent  to  a  semisphere  of  radius  R2  with  its  centre 
at  point  A ,  The  law  of  light  reflection-,  i  =  i'  follows  from  the 
equality  of  triangles  A  CD  and  ACB.  The  law  of  light  refraction 
follows  from  the  rectangular  triangles  ACB  and  ACE,  which 
have  a  common  hypotenuse.  Thus 

sin  i  _  BC _  Vf  _ 

sinf~  Je  Bs1’ 

where  n,j  is  the  relative  refractive  index  of  the  second  and 
first  media  (31.5.1). 

33.1.3  The  Huygens  principle  is  a  purely  geometrical  one.  It 
indicates  no  method  of  calculating  the  amplitude  of  the  wave 
that  is  an  envelope  of  the  secondary  waves.  Therefore,  the 
Huygens  principle  is  insufficient  for  deriving  the  laws  of  propaga- 
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tion  of  light  waves.  An  approximate  method  of  solving  this 
problem,  which  is  a  development  of  the  Huygens  principle  on 
the  basis  of  the  idea  proposed  by  Fresnel  on  the  coherence  of 
secondary  waves  and  their  interference  upon  superposition  is 
called  the  Huygens-Fresnel  principle.  This  principle  can  be 
expressed  by  the  following  series  of  propositions: 

(a)  In  calculating  the  amplitude  of  light  vibration,  set  up  by 
source  S0,  at  arbitrary  point  M,  the  source  can  be  replaced  by 
a  system  of  secondary  sources.  These  sources  are  small  portions 
ds  of  any  closed  auxiliary  surface  S  arranged  so  that  it  encloses 
the  source  S0,  but  does  not  enclose  point  M  being  considered. 

(b)  The  secondary  sources  are  coherent  with  S0  and  with  one 
another.  Hence,  the  secondary  waves  they  set  up  interfere  upon 
superposition.  Interference  calculations  are  simplest  when  S 
is  the  wavefront  for  the  light  from  source  SB  because,  in  this 
case,  the  phases  of  vibration  of  all  secondary  sources  are  the 
same. 

(c)  The  amplitude  dA  of  the  vibration  set  up  at  point  M  by 
a  secondary  source  is  proportional  to  the  ratio  of  the  area  ds 
of  the  corresponding  portion  of  wavefront  S  to  the  distance  r 
from  this  portion  to  point  M  and  depends  upon  the  angle  a 
between  the  outward  normal  to  the  wavefront  and  the  direction 
of  a  line  from  the  element  ds  of  area  to  point  M. 

Thus 

,  .  ...  ads 

dA-  -  1  (a)  — - —  ,. 

where  a  is  a  quantity  proportional  to  the  amplitude  of  the  pri¬ 
mary  wave  at  points  of  the  element  ds  of  area;  f  (a)  decreases 
inonotonically  from  1  at  a  =  0  to  0  at  jt/2  (secondary 
sources  do  not  radiate  backward)*. 

(d)  If  part  of  surface  S  is  an  opaque  screen,  the  corresponding 
secondary  sources  (closed  by  the  screen)  do  not  radiate,  whereas 
the  rest  radiate  in  the  same  way  as  in  the  absence  of  screens.** 


*  As  shown  by  Kirchhoff,  /  (a)  oc  (1  +  cos  a),  i.e.  the  function 
becomes  zero  only  at  a  =  n;  at  small  angles  of  diffraction  a, 
however,  this  more  exact  relation  is  of  no  significance. 

**  Actually,  the  material  of  the  opaque  screen  has  an  effect 
on  the  radiation  of  the  open  secondary  sources  that  are  close  to 
the  edge  of  the  screen  (at  a  distance  of  the  order  of  the  wavelength 
of  the  light). 
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33.1.4  On  the  basis  of  the  Huygens-Fresnol  principle,  the  law 
of  rectilinear  propagation  of  light  in  a  homogeneous  medium  can 
be  substantiated  from  the  wave  point  of  view.  Assume  that  S0  is 
a  point  source  of  monochromatic  light  (Fig.  33.3)  and  M  is  the 
point  of  observation.  As  the  auxiliary  surface  S,  a  spherical 
surface  of  radius  R  is  chosen  so  that  the  distance  L  from  point  M 
to  this  sphere  (L  =  OM)  is  of  the  order  of  R.  Next,  surface  S 


is  divided  into  annular  portions  of  small  area,  called  Fresnel 
zones,  as  shown  in  Fig.  33.3,  where  X  is  the  wavelength  of  the 
light. 

Vibrations  set  up  at  point  M  by  two  adjacent  zones  are  opposite 
in  phase,  because  the  path  difference  from  like  points  of  these 
zones  to  point  M  is  equal  to  a/2.  Consequently,  the  amplitude  of 
the  resultant  vibration  at  point  M  is  equal  to  A  =  —  A2  + 

+  As  —  A4  +  .  .  .,  where  At  is  the  amplitude  of  vibration  set 
up  at  point  M  by  secondary  sources,  located  within  the  limits  of 
the  ith  zone.  At  i  <c  L/X,  the  areas  of  all  the  zones  are  the  same: 
o;  =  nRLX/(R  +  L),  and  very  small  (at  R  =  L  =  10  cm  and 
=  5  X  10 _s  cm  we  obtain  «  8  x  10 cm2).  With  an 
increase  in  i,  the  distance  r;  from  the  zone  to  point  M  also 
increases,  as  does  angle  at  between  the  normal  to  the  surface  of 
the  zone  and  the  direction  of  a  line  from  the  zone  to  point  M. 
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Therefore,  according  to  the  Huygens-Fresnel  principle,  Ax  > 
>  -4  2  >  A3  >  .  .  .  and  At  ss  (A;_x  +  /li+1)/2.  Consequently, 
l he  amplitude  of  vibration  at  point  M  is  equal  to  A  «  Ax/2, 
i.o.  the  resultant  effect  of  the  whole  open  wavefront  is  equal  to 
ime-half  of  the  effect  of  the  first  (central)  Fresnel  zone,  whose 
radius  is  very  small.  Thus,  it  can  be  assumed  that  practically 
light  is  propagated  from  S0  to  M  in  a  straight  line. 

53.1.5  If  an  opaque  screen,  covering  all,  except  the  first,  Fres¬ 
nel  zones  for  the  point  M  of  observation  is  placed  in  the  path  of 
monochromatic  light  from  source  S0,  the  amplitude  and  intensi¬ 
fy  of  light  at  point  M  are  doubled  and  quadrupled,  respectively, 
compared  to  their  values  in  the  absence  of  the  screen:  A=AX 
and  /  =  Aj.  Light  can  be  intensified  to  a  substantially  greater 
extent  by  means  of  a  zone  plate.  This  is  a  glass  plate  to  which  an 
opaque  coating  has  been  applied  in  the  form  of  rings  which 
enclose  only  the  even  (or  only  the  odd)  Fresnel  zones,  i.e.  every 
other  zone.  A  zone  plate  has  the  same  effect  on  light  as  a  con¬ 
verging  lens. 


33.2  Fresnel  Diffraction 

33.2.1  The  diffraction  of  light  is  the  whole  set  of  phenomena  due 
lo  the  wave  nature  of  light  and  observed  in  its  propagation  in 
a  medium  with  sharply  defined  optical  inhomogeneities  (for 
example,  in  passing  through  apertures  in  opaque  screens  or 
close  to  the  boundaries  of  opaque  bodies).  In  its  narrower  sense, 
iliffraction  refers  to  the  bending  of  light  around  small  opaque 
obstacles,  i.e.  deviations  from  the  laws  of  geometrical  optics. 
Distinction  is  made  between  two  kinds  of  light  diffraction: 
Fresnel  diffraction,  or  diffraction  with  convergent  rays,  and  Fraun¬ 
hofer  diffraction,  or  diffraction  with  parallel  rays.  In  the  former, 
a  spherical  or  plane  wave  is  incident  on  the  obstacle  and  the 
iliffraction  pattern  is  observed  on  a  screen  at  a  finite  distance 
behind  the  obstacle.  In  the  latter,  a  plane  wave  is  incident  on 
the  obstacle  and  the  diffraction  pattern  is  observed  on  a  screen 
located  in  the  focal  plane  of  a  converging  lens  placed  in  the 
path  of  the  light  that  passes  by  the  obstacle.  In  Fresnel  diffrac¬ 
tion,  a  “diffraction  image”  of  the  obstacle  is  obtained  on  the 
screen;  in  Fraunhofer  diffraction,  the  “diffraction  image”  is  of  the 
distant  light  source. 

33.2.2  In  the  simplest  Fresnel  diffraction  problems,  the  diffrac- 
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tion  pattern  can  be  investigated  by  using  the  method  of  Fresnel 
zones  (33.1.4). 

Example  1.  Fresnel  diffraction  by  a  small  circular  aperture  in  an 
opaque  screen  AB  (Fig.  33.4).  When  the  aperture  is  illuminated 
by  monochromatic  light  of  wavelength  X,  a  system  of  alternat¬ 
ing  dark  and  bright  interference  rings  is  observed  on  screen  E, 
which  is  parallel  to  AB.  These  rings  have  a  common  centre  at 
point  0,  located  opposite  the  centre  of  the  aperture.  If  for  point  O 


an  even  number  2k  (where  k  =  1,  2, .  .  .)  of  Fresnel  zones  fit  into 
the  aperture,  there  is  a  dark  spot  at  point  O',  the  amplitude  of 
light  at  point  0  is  less  than  in  the  absence  of  a  screen: 

A  -rj-  C<4i — A2h)  <  Axl 2, 

where  At  is  the  amplitude  corresponding  to  only  the  ith  zone.  If 
the  number  of  zones  is  odd  (2k  -j-  1),  there  is  a  bright  spot 
at  point  0  and 

A^±-(A1  +  A2„+1)>A1/2. 

If  the  aperture  is  illuminated  by  white  light,  a  system  of  con¬ 
centric  coloured  rings  is  observed  on  screen  E. 

The  number  of  Fresnel  zones  that  fit  into  the  aperture  and  the 
degree  of  contrast  of  the  interference  pattern  depend  upon  the 
ratio  of  the  diameter  d  of  the  aperture  to  the  distance  l  between 
screens  AB  and  E.  As  d/l  is  increased,  the  amplitude  of  light  at 
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the  centre  of  screen  E  approaches  A 1/2  and  the  degree  of  contrast 
of  the  interference  rings  is  reduced. 

lid. 2.3  Example  2.  Fresnel  diffraction  by  a  small  disk  (opaque 
round  screen).  The  method  of  constructing  open  Fresnel  zones 
on  the  wavefront  S  of  the  incident  monochromatic  spherical 
wave  is  illustrated  in  Fig.  33.5.  The  interference  pattern  on 
screen  E  consists  of  alternating  bright  and  dark  rings  with  their 
centre  at  point  O,  where  the  interference  maximum  (Poisson  s 
spot)  is  always  observed.  The  amplitude  of  light  at  point  0  is 
one-half  of  the  amplitude  Alt  corresponding  to  the  effect  at  this 
point  of  only  the  single  first  open  Fresnel  zone:  A  =  Axl2. 
When  the  disk  is  illuminated  by  white  light,  a  white  spot  is 
observed  at  the  centre  of  the  screen.  This  spot  is  surrounded  by 
a  system  of  concentric  coloured  rings. 

As  the  ratio  of  the  diameter  d  of  the  disk  to  its  distance  l  from 
screen  E  is  increased,  the  brightness  of  Poisson’s  spot  is  gradually 
reduced  and  the  next  dark  ring  becomes  broader,  forming  a  re¬ 
gion  of  shadow  beyond  the  disk. 


33.3  Fraunhofer  Diffraction 

33.3.1  Example  1.  Diffraction  of  light  by  a  narrow  long  slit 
in  an  opaque  screen  (Fig.  33.6).  The  slit  width  BC  —  b ,  and  its 
length  in  the  direction  perpendicular 
to  the  plane  of  •  the  figure  is  l »  b. 

Light  is  normally  incident  on  the 
surface  of  the  slit  so  that  vibrations 
at  all  points  of  the  slit  are  in  phase. 

The  diffraction  pattern  is  observed  on 
screen  E,  which  is  placed  in  the  focal 
plane  of  converging  lens  L.  Parallel 
rays  BM  and  CN,  travelling  from  the 
edges  of  the  slit  at  the  diffraction 
angle  i|)  to  the  direction  of  incident 
light,  are  collected  by  the  lens  at  its 
secondary  focus  Fxil.  The  lens  are  made 
so  that  the  paths  of  the  rays  of  light  Ff>  F0 
BMFy,  and  DNF^,  where  D  is  the  Fig.  33.6 
foot  of  the  perpendicular  dropped  b 
from  point  B  to  ray  CN,  are  tautochronous  paths  (32.3.2). 
Therefore,  the  result  of  the  interference  at  point  of  the  screen 
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depends  upon  the  path  difference  CD  =  b  sin  ip  and  the  wave¬ 
length  X  of  the  light. 

(a)  Approximate  solution.  The  slit  can  be  divided  in  width  into 
Fresnel  zones,  having  the  form  of  strips  parallel  to  edge  B,  such 
that  the  path  difference  from  the  edges  of  the  strips  is  equal 
to  X/2.  The  number  of  such  Fresnel  zones  that  fit  into  the  slit 
is  equal  to  26  ] sin  ap | /X..  All  the  zones  radiate  light  entirely 
equally  in  the  direction  being  considered;  and  the  vibrations 
set  up  at  point  Fty  by  two  adjacent  zones  are  equal  in  amplitude 
and  opposite  in  phase.  Therefore,  when  there  is  an  even 
number  of  zones: 

6  sin  ip  =  ±2  m ,  where  m  =  1,  2,  ...  , 

and  a  diffraction  minimum  is  observed  (complete  darkness). 
When  the  number  of  zones  is  odd: 

X 

b  sin  \|)=  ±  (2m  + 1)  ~rr  >  where  m  —  i,  2,  ...  , 
z 


and  a  diffraction  maximum  corresponding  to  one  Fresnel  zone, 
is  observed.  The  brightest  central  maximum  is  observed  at  the 
principal  focus  F0  of  the  lens  (tf>  =  0).  With  an  increase  in  m  the 
width  of  the  Fresnel  zones  and  the  intensity  of  the  maxima  are 
rapidly  reduced. 

(b)  Exact  solution.  The  slit  is  divided  into  a  very  great  number 
of  identical  very  narrow  strips,  parallel  to  edge  B.  Secondary 
waves  radiated  by  these  elements  of  the  slit,  set  up  vibrations  at 
point  Fy  that  have  the  same  small  amplitudes,  and  their 
initial  phases  continuously  fill  an  interval  of  width  Aq>  = 
=  (2ji6  sin  ty)/X.  According  to  Sect.  32.4.4,  the  amplitude  and 
intensity  of  light  at  point  equal: 


dill  -  An 


sin 


it b  sin  ij> 


nb  sin  t|) 


sin2 


nb  sin 


and  Ity  I0  .  ^ ; 

l  X  ) 


where  A0  and  /„  are  the  amplitude  and  intensity  at  the  central 
maximum  (tj)  =  0). 
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The  condition  for  diffraction  minima  of  light  is  the  same  as  in 
the  approximate  solution:  b  sin  x|:  =  +  2mX/2,  where  m  = 
-1,  2,  .  .  .  .  The  condition  for  diffraction  maxima  is  of  the 
form 

3T&  sin  ib  niisiivUi 

lan - —  =  — 5T- 

and  it  differs  only  slightly  from  the  condition  obtained  by  means 
of  the  Fresnel  zones  method. 

33.3.2  If  white  light,  instead  of  monochromatic  light,  falls  on 
l  he  slit,  the  central  maximum  is  white  with  iridescent  colouring 
along  its  edges.  All  the  other  interference  fringes  are  coloured 
because  different  angles  i]i  and  different  points  on  the  screen 
correspond  to  both  minima  and  maxima  of  the  same  orders  m, 
depending  upon  the  wavelength  X. 

As  the  width  b  of  the  slit  is  reduced,  the  width  of  the  central 
maximum  is  increased  because  of  the  increase  of  angles  i]:,  = 
=  ± arc  sin  (XI b),  which  correspond  to  minima  of  the  first  order, 
limiting  the  central  maximum.  At  b  <  X  the  illumination  of  the 
screen  is  reduced  rnonotonically  from  the  centre  (point  F0) 
toward  the  edges.  When  the  slit  is  very  wide  (b  >■  X),  a  bright 
and  sharp  image  of  "the  light  source,  formed  by  lens  L  according 
to  the  laws  of  geometrical  optics,  is  seen  on  the  screen. 
.'13.3.3  Example  2.  Diffraction  of  light  by  a  circular  aperture. 
A  plane  monochromatic  light  wave  is  normally  incident  on  the 
aperture,  i.e.  perpendicular  to  the  plane  of  the  aperture.  The 
diffraction  pattern  is  observed  in  the  focal  plane  of  a  converging 
lens  arranged  beyond  the  aperture  so  that  its  optical  axis  is 
perpendicular  to  the  plane  of  the  aperture.  The  diffraction  pattern 
has  the  form  of  a  bright  white  spot  at  the  principal  focus  F0 
of  the  lens  and,  concentric  about  this  spot,  alternating  dark  and 
bright  rings.  The  intensity  of  the  bright  rings  is  very  low  com¬ 
pared  to  that  /„  of  the  central  maximum,  and  decreases  with  an 
increase  in  their  radius.  For  instance,  the  intensity  of  the  nearest 
maximum  of  the  first  order  to  the  central  one  is  It  <  0.02  /0. 
The  diffraction  angle  \|i, ,  corresponding  to  the  first  dark  ring, 
bounding  the  central  maximum,  complies  with  the  condition 

sim|),  1 . 22  A-  , 
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where  D  is  the  diameter  of  the  aperture  and  X  is  the  wavelength 
of  the  light. 

If  light  falls  on  the  aperture  at  a  small  angle  a  to  the  normal  to 
the  plane  of  the  aperture,  the  interference  pattern  remains  prac¬ 
tically  unchanged,  but  its  centre  is  displaced  to  a  secondary 
focus  of  the  lens,  corresponding  to  the  angle  i|)  =  a. 

33.3.4  Example  3.  Diffraction  0/  light  by  a  one-dimensional 
grating.  A  one-dimensfbnal  diffraction  grating  is  a  system  of 

'a  large  number  N  of  parallel 
slits  in  a  screen,  the  slits  being 
identical  in  width  and  separated 
by  opaque  strips,  also  of  equal 
width.  Shown  in  Fig.  33.7  are 
only  two  adjacent  slits  of  the 
grating.  The  quantity  d  —  a-\-b, 
where  a  =  CD  is  the  width  of  an 
opaque  strip  between  two  slits 
and  6  =  BC  is  the  slit  width,  is 
called  the  diffraction  grating  con¬ 
stant,  or  interval. 

In  analysing  the  diffraction  pat¬ 
tern  on  screen  E,  mounted  in 
the  focal  plane  of  converging 
lens  L,  it  is  necessary  to  take 
into  account  the  interference  of 
the  secondary  waves  not  only 
from  different  portions  of  one  slit,  but  from  the  different  slits 
as  well.  If  a  plane  monochromatic  wave  is  normally  incident 
on  the  grating,  the  vibrations  at  all  points  of  the  slits  are  in 
phase. 

Vibrations  set  up  at  the  arbitrary  point  F ^  of  the  focal  plane 
of  lens  L  by  each  of  the  slits  coincide  in  amplitude  (A!)  and 
differ  in  phase.  For  each  pair  of  adjacent  slits  the  phase  differ¬ 
ence  Acp0  between  these  vibrations  is  the  same.  It  depends  upon 
the  wavelength  X  and  the  path  difference  from  like  points  of  the 
slits  (for  instance,  points  B  and  D),  i.e.  on  the  length  KD  = 
=  d  sin  ij),  where  K  is  the  foot  of  the  perpendicular,  dropped 
from  point  B  to  the  ray  DN.  Thus 


Acp0  = 


2nd 

IT 


si  n  . 
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Therefore,  according  to  Sects.  32.4.2  and  33.3.1,  the  amplitude 
and  intensity  of  the  resultant  vibration  at  point  equal 

Ii xb  sin  Tt>  .  it Nd  sin  -ib  I 

sm  • - r — —  sin . -t - - 


nb  sin  ib  nd  sin  ib  ’ 

X  sin  X 


nb  sin  ib  .  „  nNd  sin  ib 
- =— L  sin2 - z - — 


0  /  Jt6  sin  ij)  ^  2  .  ,  nd  sin  ib  ’ 

{  X  )  sm 

where  A  0  and  /0  are  the  amplitude  and  intensity  of  the  vibra¬ 
tion  at  point  F0  (i.e.  at  i|i  =  0)  due  to  the  effect  of  a  single  slit. 
33.3.5  The  principal  minima  in  the  diffraction  of  light  by 
a  diffraction  grating  occur  at  diffraction  angles  i|i,  corresponding 
to  interference  minima  in  diffraction  by  a  single  slit  (33.3.1). 
Thus 


h  sin  i|i  =  ±mX  (m  =  1,  2,  .  .  .). 


In  these  directions  .none  of  the  slits  provide  light  (“each  slit 
extinguishes  itself”). 

The  principal  maxima  (32.4.3)  correspond  to  diffraction  angles  ip 
that  comply  with  the  condition 


d  sin  o|)  =  ±  nX, 

where  n  =  0,  1,  2,  ...  is  the  order  of  the  principal  maximum. 
Note.  If  certain  values  of  ib  simultaneously  comply  with  the 
conditions  for  both  principal  maxima  and  principal  minima, 
then  the  principal  maxima  corresponding  to  these  values  of  ib 
are  not  observed.  For  example,  if  d  =  Zb,  then  all  the  even  prin¬ 
cipal  maxima  (re  =  2,  4,  6,  etc.)  are  absent. 

The  intensity  of  a  principal  maximum  of  the  reth  order  is 


/,t  =  /o 


(Ziy 

V  nnb  / 


sm2 


nnb 

IT  ‘ 


33.3.6  Between  each  two  principal  maxima,  there  are  N  —  1 
additional  minima  that  comply  with  the  condition 
}pX 
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where  p  may  be  any  positive  whole  number  with  the  exception 
of  N,  2 N,  3 N,  etc.  Accordingly,  thereareJV—  2  additional  max¬ 
ima  whose  intensity  is  negligibly  low  compared  to  that  of 
the  principal  maxima. 

The  angular  “width,”  of  a  principal  maximum  of  the  nth  order, 
i.e.  the  difference  between  the  values  of  the  angles  if  correspond¬ 
ing  to  the  additional  minima  bounding  this  maximum,  is 


Alfn  = 


21 

Nd  cos  ifn 


21 

L  COS  if  „  ’ 


where  ifn  =  arc  sin  ( nl/d ),  and  L  =  Nd  is  the  length  of  the 
diffraction  grating.  For  principal  maxima  of  not  particularly 
high  orders,  angles  ifn  are  small  and  cos  i))n  *  1,  so  that  Aifn  js 
*  2 II L. 

33.3.7  With  monochromatic  light  the  diffraction  pattern  on 
screen  E  in  the  focal  plane  of  lens  L  (Fig.  33.7)  has,  at  large  N 

values,  the  form  of  narrow  and 
bright  principal  maxima  sepa¬ 
rated  by  practically  dark  and 
wide  intervals.  If  the  ratio  d/6 
is  an  irrational  number,  the  in¬ 
tensities  of  the  principal  maxi¬ 
ma  (33.3.5)  decrease  monotoni- 
cally  with  an  increase  in  the 
order  n  of  the  maximum. 
When  the  grating  is  illuminated 
by  white  light,  an  uncoloured 
central  zero-order  maximum  is 
observed  on  the  screen.  On  both 
sides  of  this  maximum,  diffrac¬ 
tion  spectra  of  the  1st,  2nd,  etc. 
orders  are  observed.  These  spec¬ 
tra  appear  as  iridescent  bands  in 
which  there  is  a  continuous  transition  in  colour  from  blue- 
violet  at  the  inner  edge  of  the  spectrum  to  red  at  the  outer 
edge. 

33.3.8  Upon  inclined  incidence  of  light  on  a  diffraction  grating 
(Fig.  33.8),  the  condition  for  the  principal  maxima  is  of  the 
form 


d  (sin  if  —  sin  f)  =  ±n1  (n  =  0.  1,  2,  .  .  .), 
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where  i  is  the  angle  of  incidence  of  light  on  the  surface  of  the 
grating.  Frequently,  the  directions  of  the  rays  of  light  falling 
on  the  grating  and  diffracted  from  it  are  specified  by  the  angles 
a0  and  a  which  these  rays  make  with  the  axis  OX  of  coordinates, 
arranged  in  the  plane  of  the  grating,  perpendicular  to  the  slits, 
in  this  case,  the  condition  for  the  principal  maxima  can  be 
rewritten  in  the  form 

d  (cos  a  —  cos  a0)  =  ±  n?„. 


33.3.9  Two  screens  are  said  to  be  complementary  if  the  apertures 
in  one  of  them  exactly  coincide  in  shape,  size  and  relative  posi¬ 
tion  with  the  opaque  areas  in  the  other  screen,  and  vice  versa. 
Complementary,  for  example,  are  an  opaque  screen  in  the 
form  of  a  circle  (disk)  of  radius  R  and  an  opaque  screenhaving 
a  circular  aperture  of  the  same  radius  R.  On  the  basis  of  the 
liuygens-Fresnel  principle,  it  is  possible  to  prove  Babinet's 
theorem  (also  called  Babinet's  principle).  This  theorem  states 
that  in  Fraunhofer  diffraction  by  any  screen,  the  intensity  of 
diffracted  light  in  any  direction,  except  that  of  the  propagation 
of  a  plane  wave  falling  on  the  screen,  should  be  the  same  as  in 
diffraction  by  the  complementary  screen. 

.33.3.10  Example  4.  Diffraction  by  a  large  number  of  identical 
and  identically  oriented  obstacles.  The  light  intensity  I  at  an 
arbitrary  point  M  of  the  diffraction  pattern,  like  in  diffraction 
by  a  one-dimensional  grating  (33.3.4),  can  be  presented  in  the 
form:  I  —  flv  Here  Ix  is  the  intensity  at  point  M  upon  diffrac¬ 
tion  of  the  same  incident  plane  wave  by  a  single  obstacle.  The 
function  /  depends  only  upon  the  number  and  relative  arrange¬ 
ment  of  the  obstacles. 

If  the  obstacles  are  arranged  quite  randomly  with  respect  to  one 
another  and  their  number  N  is  large,  then  fctzN.  In  this  case, 

I  lie  distribution  of  light  intensity  is  the  same  as  in  diffraction 
by  a  single  obstacle.  But  the  intensity  at  each  point  of  the 
diffraction  pattern  is  N  times  as  much.  For  example,  in  diffrac¬ 
tion  of  light  by  a  glass  plate  coated  with  spherical  dust  parti¬ 
cles  of  diameter  d,  a  system  of  bright  concentric  interference 
rings  is  observed.  The  sizes  of  these  rings  correspond  to  the 
diffraction  of  light  by  an  opaque  disk  of  diameter  d  or,  in  accor¬ 
dance  with  Babinet’s  principle  (33.3.9),  by  a  round  aperture  of 
the  same  diameter. 
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33.4  Diffraction  by  a  Space  Lattice 

33.4.1  A  space ,  or  three-dimensional,  diffraction  grating  is  an 
optically  nonhomogeneous  medium  in  which  the  inhomogeneities 
are  periodically  repeated  in  varying  all  three  spatial  coordinates. 
An  example  of  a  space  diffraction  grating  is  the  crystal  lat¬ 
tice  of  a  crystalline  solid.  Particles  at  the  lattice  points  (atoms, 
molecules  or  ions)  serve  as  orderly  arranged  centres  that  coher¬ 
ently  scatter  light  falling  on  them.  Assume  that  dx,  d2  and  d3 
are  the  lattice  constants  along  the  three  coordinate  axes  |,  r) 
and  £  that  are  oriented  along  three  edges  of  the  lattice  intersect¬ 
ing  in  one  of  its  points.  Then  in  Fraunhofer  diffraction  (33.2.1), 
the  principal  maxima  comply  with  the  Laue  conditions 

d1  (cos  a  —  cos  a0)  =  n jX,  d2  (cos  P  —  cos  P0)  =  n2X  and 
d3  (cos  y  —  cos  y0)  =  n3X. 

Here  a0,  P0  and  y0,and  a,  P  and  y  are  the  angles  between  the  coor¬ 
dinate  axes  £,  r]  and  £,  and  the  directions  of  propagation  of  the 
incident  and  diffracted  rays,  respectively,  nx,  n2  and  n3  are 
whole  numbers  determining  the  order  of  the  maximum,  and  X  is 
the  wavelength  of  the  light.  The  Laue  conditions  follow  from  the 
relations  (33.3.8)  for  diffraction  maxima  in  inclined  incidence  of 
light  on  a  one-dimensional  diffraction  grating. 

33.4.2  Of  the  three  angles  a,  p  and  y  (and  likewise  for  angles 
a0,  P0  and  yo)  only  two  angles  are  independent  because  they 
must  comply  with  one  geometric  relationship  whose  specific 
form  depends  upon  the  angles  between  the  coordinate  axes  |,  r) 
and  £.  For  example,  if  the  coordinate  axes  are  mutually  per¬ 
pendicular,  i.e.  if  the  lattice  is  orthogonal,  the  geometric  rela¬ 
tionship  between  a,  P,  and  y  is  of  the  form: 

cos2a  -f-  cos2P  -f-  cos2y  =  1. 

Therefore,  with  an  arbitrarily  given  direction  of  incidence  on 
a  space  lattice  of  monochromatic  light  with  a  given  wavelength  X, 
it  is  impossible,  in  general,  to  find  values  of  a,  P  and  y  that 
simultaneously  comply  with  the  geometric  relationship  and 
Laue’s  three  conditions.  The  only  exception  is  the  zero-order 
maximum  (nx  =  n2  =  n3  =  0),  for  which  a  =  a0,  P  =  P0 
and  v  =  Yo- 

In  order  to  observe  a  diffraction  maximum  of  the  order  (nlt  n2 
and  n3)  at  given  values  of  the  angles  a0,  P#  and  Yq>  it  is  necessary 
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for  the  incident  light  to  have  a  certain  definite  wavelength. 
For  instance,  in  the  case  of  an  orthogonal  lattice,  the  wavelength 
should  be 


X  —  —2 


fll  i  Th  o  rt  ■  no 

~r~  cos  a0-f  -f-  cos  Po  -f  -f-  cos  y0 


“1 

u2 

a  8 

(  ' 

\  dx  , 

)  d2 

)  +  \  d3  ) 

If  the  wavelength  X  of  the  incident  light  is  a  fixed  value,  the 
Laue  conditions  and  the  geometric  relationship  between  angles  a, 
f!  and  7  can  simultaneously  be  complied  with  by  a  suitable 
choice  of  the  direction  of  light  incidence  on  the  diffraction 
grating,  i.e.  of  angles  a0,  P„  and  yn. 

2.2.4. 3  It  follows  from  Laue’s  conditions  that  at  X^  2 dmax, 
where  dmax  is  the  maximum  of  the  values  of  du  d3  and  d3, 
there  should  be  no  maxima,  except  the  zero-order  one  (rj=  n2  = 
-  n3  =  0).  Light  of  such  wavelengths  propagates  in  a  medium 
“without  noticing”  its  inhomogeneities,  i.e.  without  undergoing 
diffraction.  Hence,  the  condition  X 2dmax  is  called  the 
nmdition  for  optical  homogeneity  of  the  medium. 

The  crystal  lattice  constants  of  solids  are  considerably  shorter 
than  the  wavelengths  of  visible  light  (dt  x  5  X  10-10  m,  where¬ 
as  Xvls  i  »  5  X  10~7  m).  Hence,  crystals  are  optically  homo¬ 
geneous  media  for  visible  light.*  At  the  same  time,  crystals  are 
natural  diffraction  lattices  for  X  rays  (31.4.4). 

33.4.4  The  diffraction  of  X  rays  by  crystals  can  be  interpreted  as 
I  he  result  of  interference  of  X  rays  that  are  mirror  reflected  from 
a  system  of  parallel  planes  passing  through  the  crystal  lattice 
points.  They  are  called  planes  of  atoms,  or  lattice  planes,  of  the 
crystal.  The  distance  d  between  two  adjacent  lattice  planes  is 
called  the  interplanar  spacing,  or  distance,  and  the  angle  0  be¬ 
tween  the  incident  ray  and  the  lattice  plane  (Fig.  33.9)  is  called 
Iho  grazing  angle.  The  path  difference  of  rays  reflected  from  two 
adjacent  lattice  planes**  is  A  =  BC  +  BD  —  2d  sin  0.  There- 


*  Molecular  scattering  of  visible  light  (34.3.1)  is  possible  in 
crystals.  It  is  associated  with  the  impairment  of  their  optical 
homogeneity  due  to  fluctuations  in  density. 

**  X  rays  arc  not  refracted  in  crystals  because  the  values  of  the 
refractive  index  (31.5.1)  of  all  crystals  for  electromagnetic  radi¬ 
ation  of  such  high  frequency  are  practically  equal  to  unity. 
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fore,  according  to  Sect.  32.2.1,  reflection  is  observed  only  in  the 
directions,  corresponding  to  diffraction  maxima  that  comply 
with  the  Wulff-Bragg  condition: 

2d  sin  0  =  mk , 

where  m  =  1,  2,  ...  is  the  order  of  the  diffraction  maximum. 
33.4.5  The  experimental  method  of  investigating  the  atomic 
structure  of  matter  by  analysing  the  diffraction  of  X  radiation 

as  it  passes  through  the  sub¬ 
stance  being  investigated  is 
called  X-ray  structure  analy¬ 
sis.  This  method  proves  most 
efficient  in  studying  the 
structure  of  crystalline  bod¬ 
ies.  The  diffraction  pattern, 
registered  on  photographic 
film,  is  called  the  X-ray  photo¬ 
graph,  or  pattern,  of  the  speci¬ 
men.  An  X-ray  photograph, 
obtained  in  the  diffraction  of  a  beam  of  “white”  X  radiation 
(with  a  continuous  frequency  spectrum)  by  a  monocrystal  is 
called  a  Laue  diffraction  pattern.  It  consists  of  discrete  diffraction 
(Laue  pattern)  spots  whose  position  is  determined  by  the  Laue 
conditions  (33.4.1).  A  Debye-Scherrer-lIull  powder  photograph, 
or  X-ray  powder  photograph,  is  an  X-ray  photograph  obtained  in 
the  diffraction  of  a  beam  of  monochromatic  X  radiation  by 
a  polycrystalline  specimen  (for  instance,  a  crystalline  powder). 
Such  a  photographic  pattern  consists  of  a  system  of  concentric 
diffraction  rings.  The  radii  of  the  rings  r  =  l  tan  20,  where  l  is 
the  distance  from  the  specimen  to  the  plane  of  the  X-ray  powder 
photograph,  perpendicular  to  the  incident  ray,  and  angles  0 
comply  with  the  Wulff-Bragg  condition  (33.4.4). 


33.5  Resolving  Power  of  Optical  Instruments 

33.5.1  The  image  of  the  object  in  any  optical  system  (telescope, 
microscope,  photographic  camera,  etc.)  is  obtained  by  means  of 
a  restricted  beam  of  light  admitted  into  the  instrument  by  the 
so-called  aperture  diaphragm.  Such  a  diaphragm  may  be  that  of 
a  photographic  camera,  lens  mount  of  a  telescope,  etc.  Reducing 
(he  diameter  of  the  aperture  diaphragm  helps  to  decrease  vari- 
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ous  distortions  of  the  image  resulting  from  the  use  of  wide  beams 
of  light  and  called  geometrical  aberrations  of  the  optical  system. 
But,  owing  to  diffraction  of  light  in  an  optical  instrument,  the 
image  of  a  bright  point  is  not  a  point,  but  a  bright  spot  sur¬ 
rounded  by  a  system  of  concentric  interference  rings  (alternating 
dark  and  bright  ones  in  the  case  of  monochromatic  light  and 
iridescent  in  the  case  of  white  light).  This  phenomenon  limits  the 
resolving  power  ( resolution )  of  the  optical  instrument,  i.e.  its 
capacity  to  provide  separate  images  of  two  points  of  the  object 
close  to  each  other. 

33.5.2  According  to  Rayleigh's  criterion,  the  images  of  two  iden¬ 
tical  point  sources  of  light  can  still  be  seen  separately  if  the 
central  maximum  of  the  diffraction  pattern  of  one  falls  exactly 
on  the  first  minimum  of  the  diffraction  pattern  of  the  other.  It 
follows  from  Sect.  33.3.3  that  in  accordance  with  the  Rayleigh 
criterion,  two  close  stars,  observed  in  a  telescope  with  mono¬ 
chromatic  light  of  the  wavelength  X,  can  be  resolved  (viewed 
separately)  if  the  angular  separation  (distance)  between  them  is 


Aq>>  1.22  XlD, 


where  D  is  the  diameter  of  the  objective.  The  quantity  (Acp)0  = 
=  1.22  }JD  is  called  the  angular  resolution  limit  of  the  telescope. 
The  reciprocal  quantity  l/( Aq>)0  is  called  the  resolving  power  of 
the  telescope.  The  resolving  power  of  a  telescope  increases  in 
proportion  to  the  diameter  of  its  objective.  The  condition  for 
the  resolution  of  'a  terrestrial  telescope  and  a  photographic  came¬ 
ra  in  viewing  and  photographing  distant  items  coincides  with 
that  for  an  astronomical  telescope. 

The  angular  resolution  limit  of  the  eye  is  determined  by  the 
diffraction  of  light  by  the  pupil  (D  »  2  mm)  and  the  granular 
structure  of  the  retina.  This  limit  is  about  one  minute  of  arc. 
33.5.3  The  resolving  power  of  a  microscope  is  characterized  by 
the  quantity  (A/)0,  the  shortest  distance  between  two  points  of 
the  object  that  are  separately  visible  on  the  image.  In  the  case 
of  a  self-luminous  ob  ject,  all  of  whose  points  can  be  assumed  to 
be  incoherent  sources, 


(AZ)0  = 


0.61X,0 

A 


where  X0  is  the  wavelength  of  the  light  in  vacuum,  A  =  n  sin  u 
is  the  numerical  aperture  of  the  objective,  n  is  the  refractive 
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index  of  the  medium  between  the  object  and  the  objective,  and 
u  is  one  half  of  the  aperture  angle  of  the  beam  of  light  issuing 
from  a  point  of  the  object  and  entering  the  microscope  objective 
(i.e.  the  angle  between  the  outermost  ray  entering  the  objective 
and  the  optical  axis).  For  non-self-luminous  objects,  the  value 
of  (Ai)0  depends  upon  the  illumination  conditions.  But  in  this 
case  as  well,  (Ai)0>  k0/A. 

The  resolving  power  of  a  microscope  can  be  improved  either 
by  reducing  the  wavelength  X0,  or  by  increasing  the  numerical 
aperture  A  (numerical  aperture  is  abbreviated  N.A.).  The  for¬ 
mer  method  is  employed  in  ultraviolet  and  electron  microscopy, 
and  the  latter,  in  the  immersion  microscope,  in  which  the  space 
between  the  objective  and  the  object  is  filled  with  a  transparent 
liquid  having  a  refractive  index  n- >  1 . 


33.6  Holography 

33.6.1  Holography  is  a  technique  for  obtaining  a  three-dimen¬ 
sional  optical  image  of  objects  and  is  based  on  the  phenomenon 
of  wave  interference.  In  contrast  to  the  conventional  photographic 
method,  light-sensitive  fine-grained  photographic  emulsion  is 
used  to  record  the  relations,  not  only  between  the  amplitudes 
(or  their  squares,  i.e.  the  intensities)  of  light  waves  scattered  by 
various  small  portions  of  the  object’s  surface,  but  also  between 
the  phases  of  these  waves. 

The  essence  of  the  holographic  method  is  made  clear  in 
Fig.  33.10.  Photographic  plate  P  (Fig.  33.10a)  records  the 
interference  pattern  obtained  in  superposing  wave  1,  scattered 
by  the  object  Q  and  called  the  signal  wave,  or  object  beam,  and 
wave  2,  coherent  with  wave  1  and  having  fixed  values  of  ampli¬ 
tude  and  phase.  Wave  2,  called  the  reference  wave,  or  reference 
beam,  is  emitted  by  the  same  source  of  light  that  illuminates  the 
object  and,  after  reflection  from  mirror  B,  falls  directly  on  the 
photographic  plate  P.  The  interference  pattern  recorded  on  the 
photographic  plate  after  it  is  developed  is  called  the  hologram  of 
object  Q.  It  consists  of  a  very  fine  and  intricate  pattern  of  alter¬ 
nating  interference  maxima  and  minima  in  the  blackening  of 
the  photographic  emulsion  and,  in  contrast  to  a  photographic 
image,  has  no  outward  resemblance  to  the  object. 

Holograms  are  obtained  by  the  interference  of  light  at  large  path 
differences,  i.e.  they  require  an  extremely  high  degree  of  coher- 
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ence  of  the  light  (32.2.4).  For  this  reason  lasers  (39.6.8)  are  used 
in  holography  as  the  light  source. 

33.6.2  Reconstructing  the  image  oj  the  object  from  its  hologram  C 
(Fig.  33.10b)  is  accomplished  by  passing  reference  wave  2, 
from  the  same  laser  used  to  make  the  hologram,  through  it  as 
in  the  projection  of  a  slide  (diapositive).  In  this  operation,  the 
orientation  of  the  plate  with  the  hologram  with  respect  to  the 


Fig.  33.10 


reference  wave  should  also  be  maintained.  Wave  2  is  diffracted 
by  the  hologram.  As  a  result  of  this  diffraction,  two  three- 
dimensional  images  of  the  object,  a  virtual  and  a  real  image,  are 
seen.  The  virtual  image  Q'  is  at  the  same  place,  with  respect  to 
the  hologram,  where  the  object  Q  was  in  taking  the  photograph. 
This  image  is  visible  in  looking  through  the  hologram  as  if  it 
was  a  window.  The  real  image  Q"  is  on  the  other  side  of  the 
hologram.  It  seems  to  be  in  the  air  in  front  of  the  hologram  and 
is  a  mirror  image  of  the  object. 
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Ordinarily,  the  virtual  image  of  the  object  is  used;  in  visual 
perception  it  is  identical  to  the  object  itself.  It  not  only  has 
the  property  of  being  three-dimensional,  but  its  perspective  va¬ 
ries  with  the  position  of  the  eye  of  the  observer  with  respect  to 
the  hologram.  For  example,  by  moving  the  head  along  the 
hologram,  it  is  possible  to  see  behind  some  item  located  in  the 
foreground  of  the  holographic  image. 

33.6.3  The  interference  pattern  at  each  point  of  the  hologram  is 
determined  by  light  scattered  by  all  the  points  of  the  object. 
Hence,  any  portion  of  the  hologram  contains  information 
about  the  whole  object  and  can  enable  the  image  of  the  whole 
object  to  be  restored  if,  in  damaging  the  hologram,  only  this 
single  portion  remains  intact.  The  smaller  the  size  of  the  saved 
portion  of  the  damaged  hologram,  the  less  the  light  diffracted 
by  it  in  restoring  the  image.  The  luminance  is  correspondingly 
reduced  and  the  sharpness  of  the  holographic  image  of  the 
object  is  impaired.  Thus,  a  hologram  has  essential  advantages 
with  respect  to  reliability  of  storage  of  information  over  a  con¬ 
ventional  photograph  or  photographic  negative,  each  element  of 
which  contains  information  only  about  the  portion  of  the  object 
depicted  on  it. 

Holographic  recording  of  information  is  distinguished  for  its 
high  capacity  and  compact  form.  A  great  many  holograms  can 
be  recorded  on  a  single  photographic  plate.  For  instance,  it  is 
sufficient,  for  this  purpose,  to  record  each  hologram  with 
a  different  value  of  the  angle  of  incidence  of  the  reference  wave 
on  the  photographic  plate. 

33.6.4  Holography  can  be  used  to  obtain  coloured  three-dimen¬ 
sional  images  of  objects.  To  make  such  a  “coloured”  hologram, 
use  is  made  of  the  monochromatic  light  of  three  lasers  with 
three  primary  colours  (for  instance,  red,  green  and  blue). 
The  recording  of  the  interference  patterns,  corresponding  to 
light  of  three  wavelengths,  is  accomplished  simultaneously  or 
consecutively  on  the  same  photographic  plate.  To  reconstruct 
the  coloured  three-dimensional  image  of  the  object  it  is  neces¬ 
sary  to  simultaneously  direct  onto  the  hologram  the  three  refer¬ 
ence  beams  of  monochromatic  light  that  were  used  in  recording. 

33.6.5  Three-dimensional  holograms  have  special  properties  not 
possessed  by  the  ordinary  kind.  They  are  obtained  by  using 
thick-layer  photographic  emulsions.  Recorded  on  a  three- 
dimensional  hologram  is  a  spatial  interference  pattern,  instead 
of  a  plane  one.  This  pattern  is  obtained  in  superposing  the  sig- 
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mil  and  reference  waves.  Such  a  hologram  is  similar  to  a  spatial 
diffraction  grating  (lattice).  It  is  capable  of  separating  out  of  the 
incident  white  light,  monochromatic  light  of  the  wavelength 
that  was  used  in  recording  the  hologram.  Therefore,  an  image 
recorded  as  a  three-dimensional  hologram  can  be  reconstructed 
by  illuminating  the  hologram  either  by  the  corresponding  mono¬ 
chromatic  light  or  by  white  light.  If  a  three-dimensional  holo¬ 
gram  is  “coloured”,  the  coloured  three-dimensional  image  can 
be  reconstructed  by  simply  illuminating  the  hologram  with 
white  light. 


CHAPTER  34  ABSORPTION,  SCATTERING 
AND  DISPERSION  OF  LIGHT. 
VAVILOV-CHERENKOV  RADIATION 

34.1  Interaction  of  Light  With  Matter 

34.1.1  According  to  the  concepts  of  classical  electron  theory, 
the  variable  electromagnetic  field  of  a  light  wave,  propagating 
in  a  dielectric  medium,  sets  up  forced  vibrations  of  the  bound 
charges  (electrons  and  ions)  included  in  the  composition  of  the 
molecules  of  the  'medium.  Consequently,  each  molecule  of  the 
medium  can  be  considered  as  a  system  of  oscillators  with  dif¬ 
ferent  angular  frequencies  of  natural  vibration  (28.1.1).  Ions  are 
considerably  more  massive  than  electrons  and  execute  appreci¬ 
able  vibration  only  as  the  effect  of  low-frequency  (infrared) 
radiation.  In  the  frequency  region  of  visible  and  ultraviolet 
light,  the  deciding  role  is  played  by  the  forced  vibration  of  the 
external,  more  weakly  bound  electrons  of  atoms  and  molecules. 
They  are  called  optical  electrons. 

34.1.2  Electrons  and  ions  that  execute  forced  vibrations  by 
the  action  of  light  radiate  secondary  light  waves  of  the  same 
frequency.  The  average  distance  between  the  molecules  of  a  me¬ 
dium  is  only  a  small  fraction  of  the  coherence  length  (32.1.4) 
of  the  light.  Therefore,  the  secondary  waves  radiated  by  a  great 
number  of  neighbouring  molecules  are  coherent  and  display 
interference  when  superposed. 
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If  the  medium  is  homogeneous  and  isotropic  (30.1.6),  interfer¬ 
ence  results  in  the  formation  of  a  transmitted  wave,  whose  phase 
velocity  depends  upon  the  frequency  and  whoso  direction  of 
propagation  coincides  with  that  of  the  primary  wave. 

34.1.3  When  the  medium  is  optically  nonhomogeneous,  the 
superposition  of  the  primary  and  secondary  waves  results  in 
scattering  of  the  light  (34.3.1).  Finally,  when  light  falls  on  the 
interface  between  two  different  media,  interference  results  in 
the  formation  of  a  reflected  wave  in  addition  to  the  transmitted 
wave.  Thus,  a  more  or  less  considerable  layer  of  particles  of 
the  medium,  adjoining  the  reflecting  surface,  participates  in 
the  formation  of  the  reflected  wave.  Therefore,  upon  total 
internal  reflection  (31.5.8),  the  electromagnetic  field  of  the 
light  wave  is  not  completely  cut  off  at  an  interface  with  an  opti¬ 
cally  less  dense  medium,  but  partly  penetrates  it.  The  field 
strength  E,  however,  decreases  very  rapidly  as  the  distance  to 
the  interface  increases  according  to  the  equation 

E  oc  exp  £  —  Y  (sin 2i/n|1)  —  1 J  , 

where  z  is  the  distance  from  the  interface,  i  is  the  angle  of  inci¬ 
dence  (£  >  scr),  is  the  wavelength  of  light  in  the  medium, 
icr  is  the  critical  angle  (31.5.8),  and  n21  is  the  relative  refractive 
index  of  the  medium. 


34.2  Absorption  of  Light 

34.2.1  The  absorption  of  light  refers  to  the  reduction  in  the 
energy  of  a  light  wave  as  it  is  propagated  in  a  substance.  It 
results  from  the  conversion  of  energy  of  the  wave  into  internal 
energy  (9.1.2)  of  the  substance  or  into  energy  of  secondary  radi¬ 
ation  having  a  different  spectral  composition  and  other  direc¬ 
tions  of  propagation  (photoluminescence,  Sect.  39.7.5).  Light 
absorption  may  lead  to  heating  of  the  substance,  excitation  and 
ionization  of  its  atoms  or  molecules,  photochemical  reactions  and 
other  processes  in  the  substance. 

The  absorption  of  light  is  described  by  the  Bouguer-Lamberl  law 
(also  called  the  Lambert  law),  which  states  that  the  intensity  I 
of  a  plane  wave  of  monochromatic  light  decreases  as  it  passes 
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through  an  absorbing  medium  according  to  the  exponential 
relation 

,  r  „-ax 
l  —  JQe  » 

where  /„  and  I  are  the  intensities  of  the  light  at  the  entrance  to 
and  exit  from  a  layer  of  the  medium  of  thickness  x,  and  a  is  the 
absorption  factor  ( linear  coefficient  of  absorption),  which  depends 
on  the  chemical  nature  and  state  of  the  substance  and  on  the 
wavelength  X  of  the  light. 

Beer's  law  is  valid  for  dilute  solutions  of  an  absorbing  substance 
in  a  nonabsorbing  solvent:  a  =  be,  where  c  is  the  concentration 
of  the  solution  and  b  is  a  proportionality  factor  that  is  indepen¬ 
dent  of  c.  Beer’s  law  does  not  hold  for  high  concentrations  be¬ 
cause  of  the  effect  of  the  interaction  between  the  closely  spaced 
molecules  of  the  absorbing  substance. 

.34.2.2  According  to  the  Bouguer-Lambert  law,  the  equation  of 
a  plane,  linearly  polarized,  monochromatic  light  wave,  prop¬ 
agating  in  an  absorbing  medium  along  the  positive  direction 
of  the  OX  axis,  is  of  the  form: 

/:  =E0e~ax12  cos  (cof  —  kx). 

Here  E  is  the  strength  of  the  wave’s  electric  field  at  points  with 
the  coordinate  x,  E0  is  the  amplitude  of  E  at  points  in  the  plane 
x  =  0,  co  is  the  angular  frequency  of  the  light,  k  =  2k/X  = 
—  co nlc  is  the  wavenumber,  X  is  the  wavelength  of  the  light  in 
the  medium,  c  is  the  velocity  of  light  in  free  space,  and  n  is 
the  refractive  index  of  the  medium. 

In  the  exponential  form  the  equation  of  this  wave  is  of  the  form 
(30.2.7): 

Ti  —  E  g-ax/2eHv>t-hx)  —  £  gi  (at  -nax/c) 

where  n  —  n  ^1  —  i^)  ==  n  (1  —  ix)  is  the  complex  refractive 

index  of  the  medium  ( i  =  ~\f — 1),  and  x  =  a/2 k  —  aXlin  is 
the  absorptivity,  or  absorptive  power,  which  characterizes  the 
decrease  in  the  intensity  and  amplitude  of  a  plane  wave  as  it 
propagates  in  a  medium. 

34.2.3  The  dependence  of  the  absorption  factor  a  of  a  dielectric 
on  the  light’s  wavelength  X,  characterizing  the  light  absorption 


at* 


484 ABSORPTION,  SCATTERING  AND  DISPERSION  OF  LIGHT  [34 


spectrum  in  this  medium,  is  related  to  the  resonance  phenomenon 
upon  forced  vibration  of  the  electrons  in  the  atoms  and  the 
atoms  in  the  molecules  of  the  dielectric.  Dielectrics  absorb  light 
more  or  less  selectively:  absorption  is  great  only  in  frequency 
regions  close  to  the  natural  frequencies  of  vibrations  of  the 
electrons  in  the  atoms  and  the  atoms  in  the  molecules.  This 
phenomenon  of  resonance  absorption  of  light  is  displayed  most 
clearly  in  dilute  monatomic  gases  (for  example,  the  vapours  of 
most  metals),  which  have  a  line  spectrum  of  light  absorption. 
Discrete  frequencies  of  intensive  light  absorption  coincide  with 
the  frequencies  of  the  self-radiation  of  excited  atoms  in  these 
gases. 

Systems  of  closely  spaced  lines,  forming  absorption  bands,  are 
observed  in  gases  having  polyatomic  molecules.  The  structure 
of  the  absorption  bands  is  determined  by  the  composition  and 
structure  of  the  molecules.  Liquid  and  solid  dielectrics  have 
continuous  absorption  spectra,  consisting  of  comparatively  wide 
absorption  bands,  within  which  the  absorption  factor  a  reaches 
a  quite  high  value  and  varies  smoothly  in  accordance  with  the 
wavelength  X.  This  kind  of  dependence  of  a  on  X  in  condensed 
media  is  due  to  the  strong  interaction  between  the  particles  of 
the  medium,  leading  to  the  appearance  of  a  great  number  of  addi¬ 
tional  resonance  frequencies. 

34.2.4  At  sufficiently  high  intensities  of  light,  deviations  from 
the  Bouguer-Lambert  law  (34.2.1)  are  observed:  the  absorption 
factor  of  a  dielectric  medium  begins  to  depend  upon  I,  decreas¬ 
ing  with  an  increase  in  I.  This  phenomenon,  inexplicable  within 
the  framework  of  classical  light  absorption  theory,  is  readily 
interpreted  in  the  quantum  theory  of  the  interaction  of  light 
with  matter.  In  the  absorption  of  light,  a  part  of  the  molecules 
of  the  medium  goes  over  into  an  excited  state.  The  excited  mole¬ 
cules  cannot  participate  in  further  light  absorption  until  they 
return,  after  expending  their  surplus  energy,  to  the  unexcited 
(“normal”)  state.  The  higher  the  intensity  of  the  light  and  the 
longer  the  average  lifetime  (t)  of  a  molecule  in  the  excited 
state,  the  greater  the  fraction  of  excited  molecules  in  the  medium. 
If  this  fraction  of  excited  molecules  is  negligible,  light  is  ab¬ 
sorbed  in  accordance  with  the  Bouguer-Lambert  law.  Otherwise, 
a  decreases  with  an  increase  in  the  intensity  of  the  light. 

It  is  feasible  to  obtain  a  nonequilibrium  stale  of  the  medium, 
in  which  the  fraction  of  excited  molecules  is  so  large  that  the 
absorption  factor  of  the  medium  becomes  negative.  This  phenom- 
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enon  is  made  use  of  in  a  quantum  oscillator  of  radio  waves  and 
light  (39.8.8). 

34.2.5  Metals  in  a  condensed  state  contain  a  huge  amount  of 
conduction  electrons  and,  therefore,  have  high  electrical  con¬ 
ductance.  When  subjected  to  light,  the  conduction  electrons 
execute  variable  motion  and  radiate  secondary  waves.  As 
a  result  of  the  superposition  of  the  primary  wave,  falling  on  the 
surface  of  the  metal,  and  the  secondary  wave,  an  intensive  reflect¬ 
ed  wave  is  formed  and  a  relatively  weak  wave  that  enters  the 
metal.  The  reflection  coefficient  (31.5.6)  can  reach  95  per  cent 
and  even  more.  It  depends  upon  the  surface  finish  of  the  metal, 
its  electrical  conductance  and  the  light  frequency.  The  refracted 
wave  is  rapidly  absorbed  in  the  metal.  Its  energy  is  consumed 
on  the  production  of  Joule  heat,  evolved  by  the  conduction 
currents  initiated  by  the  action  of  light  in  the  thin  layer  of 
metal  adjacent  to  the  surface.  In  the  frequency  region  of  infrared 
radiation,  the  optical  properties  of  metals  are  determined  main¬ 
ly  by  the  conduction  electrons.  But  in  the  region  of  visible 
light  and  especially  ultraviolet  radiation,  bound  electrons,  in 
the  ions  of  the  metal,  begin  to  play  an  appreciable  role.  This 
leads  to  a  reduction  in  the  reflection  coefficient  and  to  its  pro¬ 
nounced  dependence  on  the  frequency. 


34.3  Scattering  oi  Light 

34.3.1  The  scattering  of  light  is  the  phenomenon  in  which  light 
is  transformed  by  a  substance.  It  is  accompanied  by  a  change 
in  the  direction  of  light  propagation,  and  is  manifested  as  an 
extrinsic  glow  of  the  substance.  This  glow  is  due  to  the  forced 
vibration  of  the  electrons  in  the  atoms  of  the  scattering  medium 
by  the  action  of  the  incident  light.  Light  scattering  occurs  in 
an  optically  nonhomogeneous  medium,  whose  refractive  index 
varies  irregularly  from  point  to  point  due  either  to  fluctuations 
in  the  density  of  the  medium  or  to  the  presence  in  it  of  small 
foreign  particles.  In  the  former  case,  the  light  scattering  is  said 
to  be  molecular  or  Rayleigh  scattering ;  in  the  second,  light  scat¬ 
tering  in  a  turbid  medium.  Examples  of  turbid  media  are  aerosols 
(smoke  and  fog),  emulsions  and  colloidal  solutions. 

34.3.2  The  scattering  of  light  in  turbid  media  by  particles  whose 
size  is  small  compared  to  the  wavelength  X,  is  called  the  Tyndal 
effect.  A  system  of  electrons,  executing  forced  vibration  in  the 


486  ABSORPTION,  SCATTERING  AND  DISPERSION  OF  LIGHT  [34 


atoms  of  electrically  isotropic  particles  of  small  size  r0  » 
»  (0.1  to  0.2)  X,  is  equivalent  to  one  oscillating  electrical  dipole 
(linear  harmonic  oscillator).  This  dipole  oscillates  at  the  frequen¬ 
cy  v  of  the  light  incident  on  it,  and  the  intensity  of  the  light  it 
radiates  is  proportional  to  v4  (31.3.3).  Hence,  the  Rayleigh  law 
is  valid  for  scattered  light.  It  states  that  the  intensity  I  of  scat¬ 
tered  light  is  inversely  proportional  to  the  fourth  power  of  the 
wavelength:  / orX,-4.  When  white  light  passes  through  a  finely 
divided  turbid  medium,  short-wave  (dark-to-light  blue)  light 
predominates  in  the  scattered  light,  and  long-wave  (yellow-to- 
red)  light  in  the  transmitted  light.  This  explains,  for  instance, 
the  light-blue  colour  of  the  sky  and  the  yellowish-red  colour  of 
the  rising  and  setting  sun. 

In  scattering  of  natural  light  (34.1.1),  the  dependence  of  the 
intensity  of  the  scattered  light  on  the  scattering  angle  6  is  of 
the  form 

h  =  Inh  (1  +  cos2Q). 

Here  Iq  and  In/2  are  the  intensities  of  light  scattered  at  angles 
of  6  and  it/2  to  the  direction  of  the  primary  light  beam  falling 
on  a  turbid  medium.  Light  scattered  at  the  arbitrary  angle  0  is 
partly  polarized  (35.1.1);  that  scattered  at  the  angle  nl 2  is 
completely  linearly  polarized  (31.1.7):  vector  E  of  the  field  set 
up  by  this  light  is  perpendicular  to  the  plane  passing  through 
the  incident  and  scattered  rays. 

34.3.3  As  the  size  r0  of  the  inhomogeneities  in  a  turbid  medium 
is  increased,  the  laws  of  light  scattering  are  changed.  At  r0  >  X, 
the  dependence  of  Iq  on  0  is  of  complex  form,  the  intensity  of 
the  forward  scattering  of  light  (in  the  directions  0  <  it/2) 
being  greater  than  for  backward  scattering.  This  phenomenon  is 
called  the  Mie  effect.  In  this  case,  light  scattered  at  the  angle 
0  =  ji/2  is  only  partly  polarized.  The  dependence  of  the  scat¬ 
tered  light  intensity  /  on  the  wavelength  X  is  of  the  form:  I&X'P, 
where  p  <  4  and  decreases  with  an  increase  in  r0.  At  r0  >  X , 
the  spectral  compositions  of  the  scattered  and  incident  light 
practically  coincide.  This  explains,  for  instance,  the  white  col¬ 
our  of  the  clouds. 

34.3.4  Molecular  scattering  in  pure  media,  not  containing  any 
foreign  impurities,  is  due  to  inhomogeneitics  that  occur  in  the 
process  of  random  thermal  motion  of  the  particles  of  the  medium. 
These  inhomogeneities  are  associated  with  fluctuations  in  densi- 
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ly  (11.6.1),  whereas  in  media  having  anisotropic  (polar)  mole¬ 
cules,  the  inhomogeneities  are  also  due  to  fluctuations  in  the 
orientation  of  these  molecules  (fluctuations  of  anisotropy).  In 
true  solutions,  light  may  be  scattered  by  fluctuations  in  concen¬ 
tration.  Under  ordinary  conditions  the  size  of  the  regions  of  the 
medium,  corresponding  to  any  appreciable  fluctuations,  is  much 
less  than  the  wavelength  of  visible  light.  Therefore,  the  depend¬ 
ence  of  the  scattered  light  intensity  on  the  wavelength  X  and 
angle  0,  as  well  as  on  the  kind  of  polarization  in  molecular 
scattering,  is  analogous  to  the  corresponding  laws  for  the  Tyn- 
dal  effect.  But,  in  contrast  to  the  Tyndal  effect,  the  intensity 
of  molecular  scattering  of  light  depends  upon  the  temperature  of 
the  medium,  increasing  with  the  temperature. 


24.4  Normal  and  Anomalous  Light  Dispersion 

.'14.4.1  The  dispersion  of  light  is  the  dependence  of  its  phase 
velocity  v  in  a  medium  on  its  frequency  v.  According  to  Sect. 
31.5.1,  v  —  cln,  where  c  is  the  velocity  of  light  in  free  space, 


Fig.  34.1 
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and  n  is  the  refractive  index  of  the  medium.  Since  c  is  a  univer¬ 
sal  constant,  the  same  for  electromagnetic  waves  of  any  frequen¬ 
cy,  the  existence  of  the  dispersion  of  light  in  a  medium  is  due  to 
the  fact  that'  its  refractive  index  n  depends  upon  the  frequen¬ 
cy  v.  This  dependence  is  readily  detected,  for  example,  in  pas¬ 
sing  a  beam  of  white  light  through  a  prism  made  of  some  trans¬ 
parent  medium.  Observed  on  the  screen,  mounted  behind  the 
prism,  is  an  iridescent  band  (Fig.  34.1)  called  a  prismatic,  or 
dispersion,  spectrum.. 
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34.4.2  The  dependence  of  the  refractive  index  n  of  a  medium  on 
the  light  frequency  v  is  neither  linear  nor  monotonic.  The  region 
of  v  values  in  which  dnldv  >  0,  i.e.  n  increases  with  v,  corre¬ 
sponds  to  normal  dispersion  of  light.  Normal  dispersion  is  found 
for  substances  transparent  to  light.  For  example,  ordinary  glass 
is  transparent  to  visible  light  and,  in  this  frequency  region,  nor¬ 
mal  dispersion  of  light  is  observed  in  glass.  A  case  of  normal 
dispersion  of  light  is  illustrated  in  Fig.  34.1. 

The  dispersion  of  light  is  said  to  be  anomalous  if  dnldv  <  0, 
i.e.  the  refractive  index  of  the  medium  decreases  with  an  in¬ 
crease  in  v.  Anomalous  dispersion  is  found  in  frequency  regions 
that  correspond  to  intensive  absorption  of  light  in  the  given 
medium  (34.2.1).  In  ordinary  glass,  for  instance,  these  bands  are 
in  the  infrared  and  ultraviolet  parts  of  the  spectrum. 

34.4.3  Depending  upon  the  kind  of  dispersion  light  undergoes, 
its  group  velocity  u  can  be  either  more  or  less  than  its  phase 
velocity  v.  According  to  Sect.  30.4.3,  the  group  velocity  is 
related  to  the  angular  velocity  to  of  the  wave  and  to  its  wave- 
number  k  by  the  equation:  u  =  dw/dk.  Since  (o  =  2nv  and 
k  =  2n/X  =  2ji  nvlc, 


u 


c 


V 


n-f-v 


dn 

dv 


1 


v  dn  ' 
n  dv 


In  normal  dispersion  the  group  velocity  is  less  than  the  phase 
velocity  (u  <  v).  In  the  case  of  anomalous  dispersion  u.  >  v 
and,  in  particular,  if  n  +  v  dnldv  <  1,  then  u  >  c.  This  does 
not  contradict  the  statement  of  the  special  theory  of  relativity 
that  the  velocity  of  any  signal  (including  a  light  signal)  cannot 
exceed  c  (5.1.3).  The  concept  of  group  velocity  correctly  de¬ 
scribes  only  a  signal  whose  “shape”,  i.e.  the  distribution  of  ampli¬ 
tudes  and  energies  along  its  “length”,  does  not  change  when  the 
signal  travels  through  the  medium.  But  for  light,  this  condition 
is  complied  with  only  approximately;  the  narrower  the  frequency 
spectrum  of  the  signal  and  the  less  the  light  dispersion  in  the 
medium,  the  more  exactly  the  condition  is  complied  with.  In 
the  frequency  regions  corresponding  to  anomalous  dispersion, 
the  group  velocity  does  not  coincide  with  the  velocity  of  the 
signal  because,  owing  to  considerable  dispersion  of  light,  the 
“shape”  of  the  signal  changes  rapidly  as  it  propagates  in  the 
medium. 
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34.5  Classical  Electron  Theory  of  Light  Dispersion 

34.5.1  Optically  transparent  media  are  nonmagnetic  (|ir  ss  1), 
so  that  their  refractive  index  (31.5.1)  n  =  Y er  =  1^1  +  Xe< 
where  er  and  %e  are  the  relative  permittivity  (dielectric  con¬ 
stant)  and  the  dielectric  (or  electric)  susceptibility  of  the  medium 
(18.2.4).  Therefore,  the  dispersion  of  light  can  be  regarded 
as  the  dependence  of  er  and  %e  on  the  frequency  of  the  variable 
electromagnetic  field  of  the  light,  causing  electronic  polari¬ 
zation  of  the  medium  (18.2.2).  If  each  atom  (or  molecule)  of  the 
medium  contains  one  optical  electron  (34.1.1),  the  polarization 
vector  of  the  medium  (18.2.3)  Pe  =  — en0 r,  where  — e  is  the 
charge  of  the  electron,  r  is  its  displacement  from  the  equilibrium 
position,  and  n0  is  the  concentration  of  atoms  (or  molecules) 
of  the  medium.  On  the  other  hand  (18.2.4),  Pe  =  e0xeE,  where 
e0  is  the  electric  constant  (14.2.7),  and  E  is  tne  strength  of  the 
electric  field  of  the  light. 

34.5.2  An  optical  electron  executes  forced  vibrations  under  the 
effect  of  the  following  forces: 

(a)  restoring  quasi-elastic  force  (40.3.5)  Frest  =  —  mcogr,  where 
m  and  co0  are  the  mass  of  the  electron  and  the  angular  fre¬ 
quency  of  its  free ‘undamped  vibrations; 

(b)  force  of  resistance  Fresi  =  —2pm  drldt,  where  P  is  the  damp¬ 
ing  factor  of  free  vibrations  of  the  electron; 

(c)  driving  fore?  F  =  — eE,  exerted  on  the  electron  by  a  vari¬ 
able  field  of  strength  E. 

The  equation  of  forced  vibrations  is 


d2r 

It2 


eF 

m 


In  the  case  of  linearly  polarized  monochromatic  light  with  the 
angular  frequency  to,  the  field  strength  E  =  E0  cos  co t,  where 
E0  =  const  is  the  amplitude  vector.  If,  in  addition,  the  medium 
does  not  absorb  light,  then  p  =  0  and  the  steady-state  forced 
vibration  of  the  optical  electron  is  executed  according  to  the 
relation 


eE 


r  = 


m  (cog — co2) 
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Here  the  polarization  vector  of  the  medium  is 

p  ”oe"-E  and  Y  ”°g2 

e  m(©2— (o2)  Xe  e0m(o)|— co2)  ' 

The  dependence  of  the  refractive  index  of  the  medium  on  <o  is 
of  the  form 


re2  =  1  - 


e0m  (co2 — co2) 


34.5.3  At  values  of  co  close  to  co0)  the  absorption  of  light  in  the 
medium  can  no  longer  be  neglected,  assuming  that  P  =  0.  In  an 
absorbing  medium  (i.e.  at  P  ^  0),  vibrations  of  the  optical 
electron  and  of  vector  Pe  are  out  of  phase  with  respect  to  the 
oscillations  of  the  field  strength  E  (29.2.2): 


r  =  A  cos  (c at  -f  q>0), 
where 


A  = . . . — 0  —  -  -  and  tan 

m  y  (cog —  u2)2  +  4P2co2 

Accordingly 

p  _  n0e2E0  cos  (col  -f  cp0) 
m  /{cog— co2)2-|-4P2co2 


2Pco 

<P°~'  oo2— co2  • 


Introduced  to  describe  the  properties  of  a  light-absorbing  medi¬ 
um,  along  with  the  complex  refractive  index  (34.2.2)  h  = 

—  n  ( 1  —  ix),  are  the  complex  dielectric  susceptibility  % e  and  the 
complex  dielectric  constant  er: 


Xe  — 


and  er  =  l  +  Xe.  whereas  n2  —  l  +  Xe- 


.1/1.5]  ELECTRON  THEORY  OF  LIGHT  DISPERSION 


491 


Here  Pe  and  E  are  complex  values  of  the  polarization  and 
strength  of  the  field: 


~  re0e2£0gi((ji+<f») 

m  V  (0)2  —  o)2)2  +  4[l2o)2 

so  that 

rai  =  re2  (1  —  ix)2  =  l  -f 


and  E=  E0eiwt  , 


8  *711 


Y  (o)§  —  0)2)2  +  4P20 


e,(Po. 


n 2  (1  —  x2)  =  1  - 


n 0e*  cos  cp0 


e0m  Y  K  — 0)2)2 +  4p20)2 


=  1  + 


re0e2  (cog  —  co2) 


e0m  [(to| — co2)2  +  4j52o>2]  ’ 


2  n2x  = 


re0e2  sin  cp0 


2ra0e2f5o) 


s0m 


Y (o)2—o)2)2  +  4P2o)2  a0m  [(g)2  — o)2)2  +  4j32a)2]  • 


34.5.4  In  the  classical  electron  theory  of  light  dispersion  in 
gases,  each  molecule  of  the  gas  is  dealt  with  as  a  system  of  q 
linear  oscillators.  If  o )0J-  and  j+  are  the  natural  angular  fre¬ 
quency  and  damping  factor  of  the  ;th  oscillator,  then 


«2  (1  —  X2)  =  1  -|- 


nae- 


&0m 
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j= i 
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The  dimensionless  coefficient  fj  characterizes  the  contribution  of 
the  ;th  oscillator  to  the  dispersion  and  absorption  of  light.  It 
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is  called  the  oscillator  strength.  In  the  classical  theory  of  disper¬ 
sion,  the  values  of  (f>0j  and  fj  are  assumed  to  be  known  from 
experimental  investigations. 


For  gases,  *«1  and  n  differs  only  slightly  from  unity,  so  that 
re2  —  1  =  (n  +  1)  (n  —  1)  »  2  (n  —  1).  Therefore,  the  depen¬ 
dence  of  n  on  co  is  of  the  form 

_ .  ,  «0e2  vi  (Mpj  — to2)  fj 

2e°m  .“j  (“o  3--(°2)2  +  4Pj(°2 

The  curve  of  this  equation  is  shown  in  Fig.  34.2. 

Anomalous  dispersion  is  observed  close  to  each  of  the  frequen¬ 
cies  (O0j. 


34.6  Vavilov-Cherenkov  Radiation 

34.6.1  Vavilov-Cherenkov  radiation  (or  effect)  is  the  radiation  of 
light,  differing  from  luminescence  (39.7.1),  emitted  in  the  mo¬ 
tion  of  charged  particles  in  a  substance  at  velocities  V  greater 
than  the  phase  velocity  v  of  light  in  this  substance.  The  condi¬ 
tion  for  the  existence  of  this  radiation  is:  c!n  <  V  <  c,  where  c 
is  the  velocity  of  light  in  free  space  (or  in  vacuum),  and  n  >  1 
is  the  refractive  index  of  the  substance. 


•Vi.Gj  V AVtLO V-CHERENKOV  RADIATION 

In  tlie  process  of  Vavilov-Cherenkov  radiation,  the  energy  and 
velocity  of  the  free  emitting  particle  is  reduced,  i.e.  the  particle 
Is  retarded.  But,  in  contrast  to  braking  radiation  (bremsstrah- 
iitng)  of  slowly  moving  charged  particles  (31.3.4),  resulting  from, 
their  change  in  velocity,  the  reduction  in  the  velocity  of  particles 
in  Vavilov-Cherenkov  radiation  is  itself  a  consequence  of  this 
radiation.  In  other  words,  if  the  loss  of  energy  of  the  particle, 
consumed  in  emitting  Vavilov-Cherenkov  radiation,  could  be 
replenished  in  some  way  and  the  particle  would  continue  travel¬ 
ling  in  the  substance  at  a  “faster-than-light”  velocity  (V  >  v), 
Vavilov-Cherenkov  radiation  would  still  be  observed,  whereas 
there  would  be  no  braking  radiation  in  this  case. 

34.6.2  A  charged  particle  gives  rise  to  short-duration  polariza¬ 
tion  (18.2.2)  of  the  substance  in  the  vicinity  of  the  points 
through  which  it  travels.  Hence,  the  molecules  of  the  medium, 
lying  on  the  path  of  the  particle,  become  temporarily  acting 
coherent  sources  (30.5.1)  of  elementary  electromagnetic  waves 
that  interfere  upon  superposition. 

If  V  <  v  =  cln,  the  elementary  waves  cancel  one  another. 
Assuming  that  the  charged  particle  travels  at  the  velocity  V 
(where  V  <  v)  along  the  OX  axis  (Fig.  34.3)  and  is  at  points 
A  and  B  at  the  instants  of  time  t  and  t  +  At,  respectively.  The 
distance  between  A  and  B  is  l  =  FA t.  The  path  difference  of 
elementary  waves,  radiated  from  points  A  and  B  in  the  arbi¬ 
trary  direction  n,  making  the  angle  a  with  vector  V,  is 

A  =  DF  =  (v  —  V  cos  a)  At  =  l  |  — cos  a  j  . 

It  is  possible,  for  each  value  of  the  wavelength  X,  to  find  a  value 
l  —  la%  at  which  A  =  a/2,  so  that  the  elementary  waves  cancel 
one  another.  Thus 

'“=2(f-co  ,.)• 

When  l  =  lai,  radiation,  in  the  direction  n  from  any  point  M 
of  portion  A  B  of  the  path  of  the  charged  particle,  is  cancelled 
upon  interference  with  radiation  in  the  same  direction  from 
a  like  point  N  on  the  adjacent  portion  BC  =  AB  =  IaJt,  the 
distance  between  points  M  and  N  being  MN  =  la-h.  Therefore, 
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upon  motion  of  a  charged  particle  at  uniform  velocity  in  a 
straight  line  in  a  substance  at  a  “slower-than-light”  velocity, 
the  particle  does  not  omit  radiation. 

34.6.3  If  a  particle  travels  in  a  substance  at  a  “faster-than- 
light”  velocity  V  >  v  —  dn ,  the  value  of  laX,  complying  with 
the  condition  for  the  cancelling  of  elementary  waves, 


lal  — 


: — cos  a 


can  be  found  for  all  values  of  angle  a,  with  the  exception  of 


n  v  c 

6  =  arc  cos  —  —  arc  cos  — 77- . 

V  nV 

For  the  direction  0  =  a,  the  path  difteronce  of  the  elementary 
waves  emitted  by  any  two  points  A  and  B  of  the  path  of  the 
charged  particle  (Fig.  34.3)  equals  zero.  Thus 

A  =  OF  =  (v  —  V  cos 0)  At  =  0. 

Consequently,  elementary  waves,  propagating  in  the  direction 
a  =  0,  reinforce  one  another  upon  interference,  forming  a  resul¬ 


tant  radiation  in  this  direction.  This  is  Vavilov-Cherenkov 
radiation.  Light,  emitted  at  each  small  portion  of  the  path  of 
a  charged  particle,  is  propagated  along  the  elements  of  a  cone, 
whose  vertex  0  (Fig.  34.4)  is  located  on  this  portion  and  whose 
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axis  coincides  with  the  path  of  the  particle.  The  elements  make 
the  angle  0  =  arc  cos  ( c/nV )  with  the  axis.  The  light  is  pola¬ 
rized  so  that  vector  E  is  normal  to  the  surface  of  the  cone,  and 
vector  II  is  tangent  to  it. 


CHAPTER  35  POLARIZATION  OF  LIGHT 

35.1  Polarization  of  Light  in  Reflection  and  Refraction 
at  the  Interface  Between  Two  Dielectric  Media 

35.1.1  Light  emitted  by  ordinary  (nonlaser)  sources  is  made  up 
of  a  set  of  a  great  many  plane  polarized  wave  trains,  or  pack¬ 
ets  (32.1.3),  whose  electric  vectors  E  vary  along  all  possible 
directions  perpendicular  to  the  ray  (30.2.1).  Light  is  said  to  be 
natural,  or  unpolarized,  if  not  one  of  the  above-indicated  direc¬ 
tions  of  vibration  is  predominant.  In  natural  light,  the  resul¬ 
tant  strength  E  oscillates  at  each  point  of  the  field  in  a  direction 
that  rapidly  and  randomly  varies  in  a  plane  perpendicular  to 
the  ray. 

Light  is  said  to  be  partially  polarized  if  in  it  there  is  a  predomi¬ 
nant  direction  of  oscillation  of  vector  E.  Partially  polarized 
light  can  be  regarded  as  a  collection  (“mixture”)  of  natural  and 
linearly  polarized  light,  simultaneously  propagating  in  a  single 
direction. 

35.1.2  The  polarization  of  light  is  the  separation  of  linearly 
polarized  light  out  of  natural  or  partially  polarized  light.  Special 
devices,  called  polarizers,  are  used  for  this  purpose.  Their  prin¬ 
ciple  of  operation  is  based  on  the  polarization  of  light  upon 
being  reflected  or  refracted  by  the  interface  between  two  dielec¬ 
tric  media,  as  well  as  on  the  phenomenon  of  birefringence  (35.2.1) 
and  dichroism  (35.2.10).  The  same  devices  can  be  utilized  as 
analyzers,  i.e.  devices  to  determine  the  kind  and  degree  of 
polarization  of  light. 

Assume  that  linearly  polarized  light,  whose  electric  vector  Ep  is 
along  the  line  p-p  and  oscillates  with  the  amplitude  Ap,  falls 
on  the  analyzer  perpendicularly  to  the  plane  of  Fig.  35.1.  As¬ 
sume  further  that  the  electric  vector  Ea  of  the  light  passed  by  the 
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analyzer  is  along  line  a-a,  which  makes  the  angle  a  with  line 
p-p.  The  incident  light  can  be  represented  in  the  form  of  two 
waves,  linearly  polarized  in  mutually  perpendicular  planes 
(31.1.8).  The  wave  whose  electric  vector  Ej  oscillates  along 
the  direction  perpendicular  to  a-a,  with  the  amplitude  A1  — 
dp  sin  a,  cannot  pass  through  the  analyzer.  But  the  other  wave, 
whose  electric  vector  E2  oscillates  along  the  direction  of  a-a, 
with  the  amplitude  d2  =  Ap  cos  a 
passes  completely  through  the  analyz¬ 
er.  Consequently,  the  amplitude  of 
the  light  emerging  from  the  ana¬ 
lyzer  is 

Aa  =  A2  =  Ap  cos  a. 

The  intensities  Ia  and  /p  of  the  lin¬ 
early  polarized  light  incident  on  and 
emerging  from  the  analyzer,  respec¬ 
tively,  are  related  by  the  Malus  law: 

I  a—  Jp  cos2a. 

The  principal  plane  of  a  polarizer 
(or  analyzer)  is  the  plane  of  polari¬ 
zation  [plane  of  oscillation,  according  to  previous  terminology 
(31.1.7)]  of  light  passed  through  the  polarizer  (or  analyzer). 
35.1.3  In  investigating  the  laws  of  light  polarization  as  a  result 
of  reflection  and  refraction  of  natural  light,  the  light  can  be 
conveniently  dealt  with  as  a  combination  of  linearly  polarized 
waves,  of  equal  intensities,  of  two  types:  s-  and  p-waves  (31.5.4). 
The  reflection  coefficient  (31.5.6)  of  the  s- waves  (Its)  is  always 
greater  than  that  ( Rp )  of  the  p-waves.  Therefore,  in  contrast  to 
incident  natural  light,  reflected  and  transmitted  (refracted) 
light  is  partially  polarized.  Oscillations  of  the  electric  field  vec¬ 
tor  E  of  the  s-type  (perpendicular  to  the  plane  of  incidence)  pre¬ 
dominate  in  reflected  light,  whereas  oscillations  of  the  p-type  (in 
the  plane  of  incidence)  predominate  in  transmitted  light. 
Brewster’s  law  states  that  reflected  light  is  completely  linearly 
polarized  at  the  angle  of  incidence  i  =  iBr,  complying  with  the 
condition  tan  iBr  =  n21,  where  n21  is  the  relative  refractive 
index  of  a  medium  reflecting  light. 

The  angle  iBr  is  called  Brewster’s  angle.  When  i  =  iB n  the 
reflected  and  refracted  rays  are  perpendicular  to  each  other  and 
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the  reflection  coefficient  of  the  p- wave  is  Rp— 0  (31.5.6). 
Hence,  only  s-type  waves  are  reflected.  But  their  reflection 
coefficient  is  considerably  less  than  unity  (about  0.15  for  glass). 
Thus,  the  transmitted  light  is  only  partially  polarized. 

35.1.4  Polarization  of  transmitted  light  can  be  enhanced  by 
subjecting  the  light  to  a  number  of  consecutive  reflections  and 
refractions.  This  is  done  in  a  stack  consisting  of  several  identi- 
cal  and  parallel  plates  of  some  transparent  dielectric  (for  instance, 
glass),  set  at  the  Brewster  angle  to  the  incident  light  beam. 
If  the  number  of  plates  in  the  stack  is  sufficiently  large,  then 
the  light  passing  through  the  stack  is  found  to  be  practically 
linearly  polarized  (p- type).  In  the  absence  of  light  absorption 
in  the  stack,  the  intensities  Is  and  I p  of  the  reflected  and  trans¬ 
mitted  linearly  polarized  light  are  the  same  and  equal  to  one- 
lialf  of  the  intensity  /„  of  the  incident  natural  light.  Thus 


35.1.5  According  to  the  conceptions  of  classical  electron  theory, 
the  formation  of  the  reflected  wave  is  due  to  the  secondary  waves 
omitted  by  the  molecules,  the 
oscillators  of  the  light  reflecting 
medium  (34.1.3).  Oscillators 
(vibrating  electric  dipoles), 
whose  axis  is  perpendicular  to  the 
plane  of  incidence,  correspond 
to  a  wave  of  the  s-type.  These 
oscillators  are  shown  in  Fig.  35.2 
by  black  dots  on  the  refracted 
and  reflected  rays.  As  is  evident 
from  the  polar  directional  dia¬ 
gram  (Fig.  31.4),  such  oscillators 
should  radiate  intensively  in 
all  directions  lying  in  the  plane 
of  incidence,  i.e.  they  should  participate  in  the  formation  of 
both  the  reflected  and  the  refracted  s-waves. 

Oscillators,  whose  axis  lies  in  the  plane  of  incidence  and  is 
perpendicular  to  the  refracted  ray  (shown  in  Fig.  35.2  by  trans¬ 
verse  dashes),  correspond  to  a  wave  of  the  p-type.  Oscillators  do 
not  radiate  along  their  axis  (Fig.  31.4),  and  at  i  =  iBl.  the 
reflected  ray  is  perpendicular  to  the  refracted  one,  thereby  being 


Fig.  35.2 
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parallel  to  the  axes  of  these  oscillators.  Hence,  at  i  =  ier 
these  oscillators  do  not  radiate  in  the  direction  of  the  reflected 
ray,  making  no  contribution  to  the  reflected  wave.  Consequently, 
the  reflected  light  is  completely  linearly  polarized  (s-type 
waves). 


35.2  Birefringence  (Double  Refraction] 

35.2.1  Most  crystals  are  optically  anisotropic  (30.1.6).  Their 
relative  dielectric  permittivity  (dielectric  constant)  and  refrac¬ 
tive  index  depend  on  the  direction  of  the  electric  vector  E  of  the 
light  wave.  Observed  in  optically  anisotropic  crystals  is  the 
phenomenon  of  double  refraction,  or  birefringence,  .which  consists 
in  the  splitting  of  a  light  ray,  falling  on  a  surface  of  the  crystal, 
into  two  refracted  rays.  The  birefringence  of  light  in  a  crystal 
of  Iceland  spar  (CaC03)  is  illustrated  in  Fig.  35.3. 

35.2.2  The  optic  axis  of  a  crystal  is  the  direction  in  an  optically 
anisotropic  crystal  along  which  light  propagates  without  under- 


Fig.  35.3  Fig.  35.4 


going  birefringence.  It  is  necessary  to  underline  that  the  optic 
axis  of  a  crystal  is  no  single  special  straight  line  similar,  for 
instance,  to  the  axis  of  symmetry  of  a  body.  It  only  character¬ 
izes  a  certain  direction  in  the  crystal  and  can  be  passed  through 
any  point  of  the  crystal. 

Depending  upon  their  type  of  symmetry,  optically  anisotropic 
crystals  can  be  either  uniaxial  or  biaxial,  i.e.  they  can  have  ei¬ 
ther  one  or  two  optic  axes.  An  example  of  a  uniaxial  crystal  is 


2 1  BIREFRINGENCE 


499 


Iceland  spar,  whose  optic  axis  coincides  in  direction  with  the 
diagonal  M0N0  of  the  crystal  (Fig.  35.3). 

The  principal  plane  (or  principal  section)  of  a  uniaxial  crystal 
for  a  certain  ray  is  the  plane  passing  through  this  ray  and  inter¬ 
secting  the  optic  axis. 

35.2.3  In  a  uniaxial  crystal,  one  of  the  rays  formed  in  birefrin¬ 
gence  (double  refraction)  obeys  the  ordinary  laws  of  light  refrac- 
lion  (31.5.3).  It  lies  in  the  plane  of  incidence  and  complies  with 
Snell’s  law.  It  is  therefore  called  the  ordinary,  or  O  ray.  The 
second  ray  is  called  the  extraordinary  or  E  ray  because  it  general¬ 
ly  does  not  lie  in  the  plane  of  incidence  and  does  not  obey 
Snell’s  law.  For  example,  even  in  the  case  of  normal  incidence 
of  light  on  a  plate  cut  from  a  uniaxial  crystal,  the  extraordinary 
ray  is  refracted  (Fig.  35.4).  Its  refractive  angle  re  depends  on 
I  iow  the  surface  of  the  plate  is  oriented  with  respect  to  the  optic 
axis  of  the  crystal.  It  may  be  equal  to  zero  only  in  two  cases: 
(a)  if  the  surface  of  the  plate  is  perpendicular  to  the  optic  axis 
(light  propagates  in  the  plate  along  the  optic  axis  and  is 
(hereby  not  subject  to  birefringence);  (b)  if  the  surface  of  the 
plate  is  parallel  to  the  optic  axis  (light  propagates  in  the  plate 
perpendicular  to  the  optic  axis). 

In  a  biaxial  crystal  hoth  refracted  rays  behave  like  extraordi¬ 
nary  ones. 

35.2.4  Birefringence  indicates  that  a  light  wave  incident  on 
an  optically  anisotropic  crystal  produces  two  waves,  propagat¬ 
ing  through  the  crystal,  generally  in  different  directions.  In 
a  uniaxial  crystal,  they  are  called  the  ordinary  and  extraordinary 
waves.  The  ordinary  and  extraordinary  rays  indicate  the 
directions  of  the  Umov-Poynting  vectors  (31.2.3)  of  the  corre¬ 
sponding  waves  in  the  crystal,  i.e.  the  direction  of  energy  trans¬ 
fer  by  these  waves. 

The  ordinary  and  extraordinary  waves  are  linearly  polarized 
(31.1.7).*  In  the  ordinary  wave,  vector  E  is  perpendicular  to 
the  principal  plane  of  the  crystal  for  the  ordinary  ray.  The 
electric  vector  E  of  the  extraordinary  wave  lies  in  the  principal 
plane  of  the  crystal  for  the  extraordinary  ray.  The  directions 
of  vectors  E  in  the  ordinary  and  extraordinary  waves  are  con¬ 
ditionally  shown  in  Fig.  35.4  by  black  dots  on  the  ordinary 


*  Frequently,  mention  is  made  of  linear  polarization  of  the 
ordinary  and  extraordinary  rays,  actually  meaning  the  polariza¬ 
tion  of  the  waves  corresponding  to  them. 
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ray  and  transverse  dashes  on  the  extraordinary  ray  (it  is  as¬ 
sumed  that  both  rays,  and  optic  axis  MN  intersecting  them,  lie 
in  the  plane  of  the  figure). 

35.2.5  The  ray  velocity  oj  the  wave,  or  simply  ray  velocity ,  in  an 
optically  anisotropic  crystal  is  the  velocity  v  of  energy  transfer 
by  the  wave.  In  a  uniaxial  crystal,  the  velocity  v0  of  the  ordinary 
ray  is  numerically  equal  in  all  directions:  v0  =  cln0,  where  c  is 
the  velocity  of  light  in  free  space  (or  in  vacuum),  and  n0  =  const 
is  the  refractive  index  of  the  crystal  for  the  ordinary  ray.  According¬ 
ly,  the  velocity  ve  of  the  extraordinary  ray  is  numerically  equal 
to  ve  =  clne,  where  ne  is  the  refractive  index  of  the  crystal  for 
the  extraordinary  ray.  The  values  of  ne  and  ve  depend  on  the 
direction  of  the  extraordinary  ray  with  respect  to  the  optic  axis 
of  the  crystal.  For  a  ray  propagating  along  the  optic  axis, 
ne  =  n„  and  ve  =  v0.  The  value  of  ne  differs  to  the  greatest 
extent  from  that  of  n0  for  a  direction  perpendicular  to  the  optic 
axis:  ne  =  neo. 

35.2.6  The  ray  surface  of  a  wave  in  a  crystal  is  the  locus  of  the 
heads  of  vectors  v  of  the  ray  velocity  of  the  wave  that  extend 
from  a  certain  point  0  in  the  crystal  in  all  possible  directions. 


Fig.  35.5 

In  a  uniaxial  crystal,  the  ray  surface  of  the  ordinary  wave  has 
the  shape  of  a  sphere,  whereas  that  of  the  extraordinary  wave  is 
an  ellipsoid  of  revolution  about  the  optic  axis  MJV  passing 
through  point  0.  The  ellipsoid  and  sphere  contact  each  other 
at  the  points  of  intersection  with  optic  axis  MN.  When  ne>  n0, 
the  ellipsoid  is  inscribed  in  the  sphere  (Fig.  35.5 a),  whereas  if 
ne  n0  the  ellipsoid  is  circumscribed  about  the  sphere 
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(Fig.  35.55).  In  the  former  case,  the  uniaxial  crystal  is  said  to 
be  optically  positive ;  in  the  latter,  optically  negative. 

35.2.7  Birefringence  in  a  uniaxial  crystal  can  be  explained 
and  the  directions  of  the  ordinary  and  extraordinary  rays  can  be 
found  by  using  the  Huygens  graphical  method.  Assume  that 
a  plane  unpolarized  light  wave  is  incident  on  the  plane  sur¬ 
face  ab  of  a  uniaxial  optically  negative  crystal  (or  a  plate  cut  out 
of  this  crystal)  at  the  angle  i  (Fig.  35.6).  The  optic  axis  MN 
of  the  crystal,  passing  through  point  A  of  surface  ab,  lies  in 
I  he  plane  of  the  figure  and  makes  the  angle  y  with  plane  ab. 


Fig.  35.6 

At  the  instant  t  of  time  being  considered,  the  front  AD  of  the 
incident  wave  has  reached  point’d  on  the  surface  of  the  crystal, 
and  this  point  becomes  the  source  of  two  linearly  polarized 
elementary  secondary  waves,  ordinary  and  extraordinary,  in  the 
crystal.  At  the  instant  of  time  t  -)-  A t,  where  At  is  the  time 
required  for  the  incident  light  to  travel  the  distance  DK,  the 
disturbance,  propagating  from  point  A  in  the  form  of  the  ordi¬ 
nary  elementary  wave,  reaches  the  points  of  a  sphere  of  radius 
r„.\t  having  its  centre  at  point  A.  The  disturbance,  propagating 
from  point  A  in  the  form  of  the  extraordinary  elementary  wave, 
reaches  points  at  the  same  time  on  the  surface  of  the  ellipsoid 
of  revolution  that  contacts  the  sphere  of  radius  v0At  at  point  L 
of  its  intersection  with  optic  axis  MN.  This  ellipsoid  is  geomet¬ 
rically  similar  to  the  ray  surface  of  the  extraordinary  wave  in 
flic  crystal  (35.2.5). 


502 


POLARIZATION  OF  LIGHT  [35 


Planes  KC0  and  KCc,  perpendicular  to  the  plane  of  the  figure 
and  tangent,  respectively,  to  the  sphere  and  the  ellipsoid, 
indicate,  according  to  the  Iluygens  principle  (33.1.1),  the 
positions,  at  the  instant  of  time  t  -f-  Ai,  of  the  fronts  of  the 
ordinary  and  extraordinary  waves  that  actually  propagate  in 
a  uniaxial  crystal.  Straight  lines  from  point  A  to  the  points 
of  tangency  B  and  F  indicate  the  directions  of  the  ordinary  and 
extraordinary  rays.  Both  rays  lie  in  the  plane  of  incidence,  but 
the  extraordinary  ray  is  not  orthogonal  to  the  wave  surface  KCe. 
The  ordinary  and  extraordinary  waves  are  linearly  polarized 
in  mutually  perpendicular  planes.  The  directions  of  the  electric 
vectors  E0  and  Ee  in  the  ordinary  and  extraordinary  waves  are 
shown  in  Fig.  35.6  by  black  dots  and  transverse  dashes  located 
on  the  corresponding  rays. 

Note.  If  the  optic  axis  MN  of  the  crystal  does  not  lie  in  the  plane 
of  incidence  of  the  light,  then,  in  general,  the  extraordinary 
ray  also  does  not  lie  in  the  plane  of  incidence.  Consequently, 
the  angle  between  the  planes  of  polarization  of  the  ordinary 
and  extraordinary  waves  differs  slightly  from  a  right  angle. 
35.2.8.  The  construction  for  the  ordinary  and  extraordinary 
rays  in  the  case  of  normal  incidence  of  light  on  the  surface  of 


an  optically  negative  uniaxial  crystal  is  shown  in  Fig.  35.7. 
Here  ab  is  the  position  of  the  incident  wavefront  at  the  instant  t 
of  time,  and  C0C'„  and  CeC'e  are  the  positions  of  the  fronts  of  the 
ordinary  and  extraordinary  waves  in  the  crystal  at  the  instant 
of  time  t  +  At. 

It  is  assumed  that  the  optic  axis  MN  lies  in  the  plane  of  inci¬ 
dence  and  makes  the  angle  y,  differing  from  0  and  jx/2,  with 
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the  refracting  surface  ab.  It  is  evident  from  Fig.  35.7  that  the 
ordinary  ray  is  an  extension  of  the  incident  one  and  that  the 
extraordinary  ray  is  refracted  by  the  angle  re  0. 

Considered  in  Fig.  35.8  is  a  case  in  which  light  is  normally 
incident  on  the  plane  surface  ab  of  an  optically  negative  uni¬ 
axial  crystal,  whose  optic 
axis  MN  is  parallel  to  ab. 

Here,  as  is  evident  from 
the  construction,  the  extra¬ 
ordinary  ray  is  not  refract¬ 
ed  at  the  surface  ab  and 
coincides  in  direction  with 
the  ordinary  and  incident 
rays.  But  the  velocities  of 
the  ordinary  and  extraor¬ 
dinary  rays  in  the  crystal 
in  this  direction  differ  and 
are  respectively  equal  to 
(35.2.5):  v0  =  cln0  and  ve  = 

=  clne0.  Therefore,  the  two  rays  (waves)  travel  the  same 
distance  d  in  the  crystal  with  an  optical-path  difference 
(32.3.2): 

A s  =  d  ( n0  —  ne0). 

35.2.9.  The  path  of  the  rays  in  a  polarizing  prism  are  shown 
in  Fig.  35.9.  This  prism  is  cut  out  of  a  crystal  of  Iceland  spar 

N\ 
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Fig.  35.9 

so  that  its  sides  AB  and  CD  are  parallel  to  its  optic  axis  MN. 
Then  the  prism  is  cut  apart  along  its  diagonal  plane  AC  and 
glued  together  along  these  diagonal  surfaces  by  a  thin  layer  of 
an  optically  isotropic  transparent  substance  called  Canada  bal¬ 
sam.  The  crystal  of  Iceland  spar  is  uniaxial  and  optically  nega- 
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tive;  its  refractive  indices  (35.2.5)  are:  nQ  =  1.658  and  ne0= 
=  1.486.  The  refractive  index  of  Canada  balsam  is  «cb  =  1-550, 
i.e.  Canada  balsam  is  a  medium  less  optically  dense  than  the 
material  of  the  prism  for  the  ordinary  ray  and  more  optically 
dense  than  this  material  for  the  extraordinary  ray.  Light  falls 
on  the  prism  normal  to  its  side  AB  (ray  S  in  Fig.  35.9).  The  ordi¬ 
nary  and  extraordinary  rays  propagate  in  the  prism  without 
being  refracted  until  they  reach  the  layer  A  C  of  Canada  balsam. 
The  dimensions  of  the  prism  arc  selected  so  that  the  angle  of 
incidence  i  of  the  ordinary  ray  on  surface  AC  is  greater  than  the 
critical  angle  for  total  internal  reflection  (31.5.8).  Therefore, 
the  ordinary  wave  is  completely  reflected  by  the  layer  of  Canada 
balsam  (ray  o  in  Fig.  35.9).  The  extraordinary  wave  passes  freely 
through  the  layer  of  Canada  balsam  and  the  second  half  of  the 
polarizing  prism.  Thus,  the  polarizing  prism  can  be  used  as  a 
polarizer  (35.1.2). 

35.2.10.  All  birefringent  crystals  absorb  light  to  one  or  another 
degree.  This  absorption  is  anisotropic:  the  absorption  factor 
(34.2.1)  depends  upon  the  orientation  of  the  electric  vector  of 
the  light  wave  and  on  the  direction  of  light  propagation  in  the 
crystal,  as  well  as  on  the  wavelength.  This  phenomenon  is 
called  dichroism,  or  pleochroism,  because  it  is  manifested  in  the 
different  colouring  of  the  crystals  in  different  directions.  An 
example  of  a  strongly  dichroic  crystal  is  tourmaline,  a  uniaxial 
crystal  in  which  the  ordinary  ray  is  absorbed  much  more  intense¬ 
ly  than  the  extraordinary  ray.  Even  more  pronounced  dich¬ 
roism  is  displayed  by  crystals  of  horapathito,  which  is  used  to 
make  thin  films,  converting  natural  light  into  linearly  polar¬ 
ized  light  and  called  polaroids. 


35.3  Interference  of  Polarized  Light 

35.3.1.  Trains  of  waves  with  all  possible  orientations  with  re¬ 
spect  to  the  ray  of  their  planes  of  polarization,  included  in  the 
composition  of  natural  light,  are  incoherent  because  they  cor¬ 
respond  to  the  radiation  of  various  independent  atoms  of  the 
light  source.  All  these  trains  participate  in  the  formation  of  the 
ordinary  and  extraordinary  waves  that  propagate  in  a  uniaxial 
crystal  when  natural  light  falls  on  it.  But  the  contribution  of 
each  separate  train  to  these  two  waves  is,  in  general,  not  the 
same.  It  is  greater  to  the  wave  whose  plane  of  polarization  makes 
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the  smaller  angle  a  with  the  plane  of  polarization  of  the  wave 
train.  In  other  words,  the  ordinary  and  extraordinary  waves 
are  generated  mainly  by  different  wave  trains  included  in  the 
composition  of  natural  light.  Consequently,  the  ordinary  and 
extraordinary  waves,  propagated  in  a  uniaxial  crystal  on  which 
natural  light  falls,  are  not  coherent. 

55.3.2.  The  ordinary  and  extraordinary  waves,  propagated  in 
a  uniaxial  crystal  upon  the  incidence  of  linearly  polarized  light 
[obtained  from  natural  light  by  means,  for  example,  of  a  po¬ 
larizing  prism  (35.2.9)  or  some  other  polarizer]  are  coherent  with 
each  other.  This  is  due  to  the  fact  that  the  planes  of  polarization 
of  all  the  wave  trains  included  in  the  composition  of  the  inci¬ 
dent  light  have  the  same  orientation. 

Assume  that  a  parallel  beam  of  light,  after  passing  through  polar¬ 
izer  P  (Fig.  35.10),  is  normally  incident  on  the  plane  surface  ab 


Fig.  35.10 


of  plane-parallel  plate  B,  cut  out  of  a  uniaxial  crystal  parallel 
to  its  optic  axis  MN  (axis  MN  is  parallel  to  plane  ab).  Shown 
in  Fig.  35.11  is  vector  A;  of  the  amplitude  of  the  zt.h  wave  train. 
This  vector  is  oriented  along  line  p-p,  corresponding  to  the  di¬ 
rection  of  oscillation  of  the  electric  vector  in  the  light  emerging 
from  the  polarizer.  The  contributions  of  the  zth  wave  train  to 
the  ordinary  and  extraordinary  waves  are  characterized  by  the 
amplitudes  Aio  =  A  i  sin  a  and  A^-Af  cos  a,  whose  ratio 
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(Alol Aie)  =  tan  a  is  the  same  for  all  trains.  In  particular, 
if  a  =  ji/4,  then  Ai0  — -  Aie  because  pairwise  coherent  trains, 
polarized  in  mutually  perpendicular  planes,  have  the  same  in¬ 
tensity. 

35.3.3.  At  the  entrance  to  crystal  plate  B  (35.3.2),  the  electric 
vectors  E0  and  Eg  of  the  ordinary  and  extraordinary  waves  os¬ 
cillate  in  phase  and  their  vector  sum  is  equal  to  electric  vector 
of  linearly  polarized  monochromatic  incident  light:  Ep  = 
=  E0  -f-  Ee.  In  the  plate  the  ordinary  and  extraordinary  waves 
propagate  at  different  velocities  (35.2.8).  Therefore,  at  the  exit 
from  the  plate  (of  thickness  d),  the  mutually  perpendicular  elec¬ 
tric  vectors  E'  and  Eg  of  the  ordinary  and  extraordinary  waves 
oscillate  with  a  phase  difference  of 

.  2itAs  2nd  , 

A<p=  — -r - =—5 — (n0  —  ne  0), 

A0  Ao 

where  As  is  the  optical-path  difference  of  these  waves  (35.2.8), 
and  is  the  wavelength  of  light  in  vacuum.  Consequently,  as 
a  result  of  its  passing  through  the  plate,  the  light,  in  the  general 
case,  becomes  elliptically  polarized  (31.1.7):  the  head  of  vector 
E'  =  E'  +  Eg  describes  an  ellipse  lying  in  a  plane  perpendicu¬ 
lar  to  the  beam.  If  a  is  the  angle  between  the  direction  of  oscil¬ 
lation  of  vector  Ep  and  the  optic  axis  MN  of  the  plate,  then  am¬ 
plitudes  Aa  and  Ae  of  vectors  E0  and  Ee  equal:  A0=Ap  sin  a 
and  Ae  =  Ap  cos  a,  where  Ap  is  the  amplitude  of  vector  Ep. 
In  the  absence  of  light  absorption  in  the  plate,  the  amplitudes  of 
vectors  E'  and  Eg  are  also  equal  to  A0  and  A  e. 

35.3.4.  In  accordance  with  the  thickness  d  of  the  plate,  several 
special  cases  are  possible. 

(a)  A  quarter-wave  plate  is  one  whose  thickness  satisfies  the  rela¬ 
tion:  d(n0  —  ne0)  =  ±  (m  -(-  1/4)  X0,  where  m  =  0,  1,  2,  ...; 
the  plus  sign  corresponding  to  an  optically  negative  crystal, 
and  the  minus  sign  to  an  optically  positive  one  (35.2.6).  At  the 
exit  from  such  a  plate  the  oscillations  of  vectors  E'  and  Eg  have 
a  phase  difference  of  n/2.  If,  in  addition,  a  =  it/4,  the  light 
emerging  from  the  plate  is  circularly  polarized  (31.1.7). 

(b)  For  a  halj-wave  plate:  d  (n0  —  ne0)  =  +  (m  -|-  1/2)  A,0. 
At  the  exit  from  such  a  plate,  the  oscillations  of  vectors  E0 
and  Eg  have  a  phase  difference  of  jt.  Light  emerging  from  the 
plate  remains  linearly  polarized.  But  the  directions  of  oscijja- 
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lion  of  vectors  Ep  and  E'  of  the  incident  and  transmitted  light 
are  symmetrical  with  respect  to  the  principal  plane  of  the  plate 
(Fig.  35.12). 

(c)  For  a  full-wave  plate :  d  ( n0 —  ne0)  --  ztmX0.  As  a  result  of  pass¬ 
ing  through  the  plate,  the  light  remains  linearly  polarized 
in  the  same  plane  as  the  incident 
light. 

35.3.5.  Coherent  waves  emerging  from 
crystal  plate  B  (Fig.  35.10)  cannot 
display  interference  because  they  are 
polarized  in  mutually  perpendicular 
planes.  Therefore,  one  more  polar¬ 
izing  prism,  the  analyzer  A  (Fig.  35.13) 
is  mounted  beyond  plate  B.  The  ana¬ 
lyzer  separates  out  of  the  incident 
coherent  waves,  the  components  po¬ 
larized  in  one  plane.  It  thus  provides 
the  conditions  necessary  for  obtain¬ 
ing  the  interference  of  these  waves. 

The  results  of  this  interference  depend  upon  the  phase  difference 
A(p  acquired  by  the  ordinary  and  extraordinary  waves  in  the 
plate,  on  the  ratio'of  the  amplitudes  of  these  waves  and  on  the 
angle  f)  between  the  principal  planes  of  the  analyzer  and  po¬ 
larizer  (35.1.2). 

For  example,  if  the  angle  between  the  principal  plane  of  the 
polarizer  and  optic  axis  MN  of  the  plate  a  =  n/4,  the  ampli¬ 
tudes  and  intensities  of  the  ordinary  and  extraordinary  waves 
are  the  same.  Assume  in  this  case  that  monochromatic  light  of 
the  wavelength  >,0  in  vacuum  is  incident  on  the  plate.  The  two 
following  limiting  cases  may  occur: 


Acp  = 


2nd 

X0 


±  2mn 
±  (2 in  -|- 1)  n 


(where  in  =  0,  1,  2,  ...). 


In  the  first  case,  corresponding  to  a  full-wave  plate,  the  light 
falling  on  the  analyzer  is  linearly  polarized  in  the  principal 
plane  of  the  polarizer.  Therefore,  at  fi  =  0  (the  analyzer  is 
mounted  parallel  to  the  polarizer),  the  intensity  7a  of  the  light 
passing  through  the  analyzer  is  maximal,  whereas  at  P  =  nl 2 


508 


POLARIZATION  OF  LIGHT  [35 


(tho  analyzer  and  polarizer  are  crossed),  Ia  —  0,  i.e.  at  P  =  0 
an  interference  maximum  is  observed,  whereas  at  P  =  jt/2  a 
minimum  is  observed. 

In  the  second  case,  corresponding  to  a  half-wave  plate,  the  light 
falling  on  the  analyzer  is  linearly  polarized  in  a  plane  making 
the  angle  2a  =  ji/2  with  the  principal  plane  of  the  polarizer. 
Therefore,  at  P  =  0  an  interference  minimum  is  observed, 
whereas  at  P  =  ji/2  a  maximum  is  observed. 

If  linearly  polarized  white  light  is  incident  on  plate  B  (Fig.  35.13), 
then  in  observing  the  light  through  the  analyzer,  the  plate  is 


Fig.  35.13 

seen  to  be  coloured.  Upon  rotating  the  analyzer  about  the  beam, 
i.e.  in  varying  angle  p,  the  colouring  changes.  This  is  due  to 
the  fact  that  the  value  of  the  phase  difference  Acp,  determining 
the  result  of  interference,  depends  upon  the  wavelength  of  the 
light.  A  plate,  whose  thickness  d  is  not  the  same  at  various 
places,  is  seen  in  white  light  to  be  oddly  coloured,  with  each  col¬ 
oured  interference  line  (isochromate)  passing  through  points  of 
equal  thickness  d.  A  similar  pattern  is  observed  in  a  plate,  which 
is  of  equal  thickness  at  all  points,  but  the  value  of  the  difference 
(re0  —  ne0)  varies.  In  this  case,  each  isochromatc  passes  through 
points  of  the  plate  corresponding  to  the  same  value  of 

(tlo  neo)' 


35.4  Artificial  Optical  Anisotropy 

35.4.1  An  optically  isotropic  transparent  body  becomes  an¬ 
isotropic  if  it  is  subjected  to  mechanical  deformation.  This 
phenomenon  is  sometimes  called  pholoelasticitij.  Upon  linear 
(uniaxial)  tension  or  compression  of  an  isotropic  body  along 
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the  OX  axis,  it  acquires  the  optical  properties  of  a  uniaxial 
crystal  (35.2.2),  whose  optic  axis  is  parallel  to  OX.  The  differ¬ 
ence  between  the  refractive  indices  for  the  ordinary  (n0)  and 
extraordinary  (ne0)  rays,  in  the  direction  perpendicular  to  axis 
OX,  is  proportional  to  the  normal  stress  0  (40.3.3).  Thus 


n0  -  ne0  —  ka, 

where  k  is  the  proportionality  factor  depending  upon  the  prop¬ 
erties  of  the  body. 

35.4.2  The  electrooptical  Kerr  effect  is  the  appearance  of  optical 
anisotropy  in  a  transparent  isotropic  solid,  liquid  or  gaseous 
dielectric  upon  being  placed  in  an  external  electric  field.  By 
the  action  of  the  uniform  electric  field,  the  dielectric  is  polar¬ 
ized  and  acquires  the  optical  properties  of  a  uniaxial  crystal 
whose  optic  axis  coincides-  in  direction  with  the  field  strength 
vector  E.  The  difference  between  the  refractive  indices  of  a 
polarized  dielectric  for  extraordinary  and  ordinary  rays  of 
monochromatic  light,  propagating  perpendicular  to  the  direc¬ 
tion  of  vector  E,  satisfies  the  Kerr  law: 

ne  0  —  n0  —  Bk0E 2, 

where  a0  is  the  wavelength  of  the  light  in  vacuum,  and  B  is 
the  Kerr  constant.  The  value  of  B  depends  upon  the  nature  of  the 
substance,  wavelength  X0  and  the  temperature,  decreasing,  as 
a  rule,  as  the  temperature  is  raised.  The  sign  of  the  difference 
( neB  —  n0)  is  independent  of  the  direction  of  the  field.  For  the 
majority  of  substances  B  >  0,  so  that  they  are  similar  with 
respect  to  their  optical  properties  in  a  uniform  electric  field 
to  optically  positive  uniaxial  crystals  (35.2.6). 

35.4.3  The  Cotton-Mouton  effect  is  the  appearance  of  optical 
anisotropy  in  certain  isotropic  substances  (liquids,  glasses  and 
colloids)  when  they  are  placed  in  a  strong  external  magnetic 
field.  In  a  uniform  magnetic  field,  the  substance  acquires  the 
optical  properties  of  a  uniaxial  crystal,  whose  optic  axis  coin¬ 
cides  in  direction  with  the  magnetic  field  strength  vector  H.  The 
difference  between  the  refractive  indices  of  the  substance  for 
extraordinary  and  ordinary  rays  of  monochromatic  light,  prop¬ 
agating  in  the  direction  perpendicular  to  vector  H,  is  propor¬ 
tional  to  IP.  Thus 

nea  '  no  =  CXBIP, 
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where  C  is  the  Colton-Mouton  constant,  and  X0  is  the  wavelength 
of  light  in  vacuum.  The  value  of  constant  C  depends  upon  the 
nature  of  the  substance,  wavelength  X0  and  the  temperature. 


35.5  Rotation  of  the  Plane  of  Polarization 

35.5.1  As  linearly  polarized  light  passes  through  certain  sub¬ 
stances  that  are  said  to  be  optically  active,  the  plane  of  polar¬ 
ization  of  the  light  (31.1.7)  rotates  about  the  direction'of  the  ray. 
Optical  activity  is  displayed  by  certain  crystals  (quartz,  cin¬ 
nabar,  etc.),  pure  liquids  and  solutions  (turpentine,  a  solution 
of  sugar  in  water,  etc.).  All  substances  that  are  optically  active 
in  the  liquid  state,  possess  the  same  property  in  the  crystalline 
state  as  well.  But  some  substances  that  are  optically  active  in 
the  crystalline  state,  lose  this  activity  in  the  liquid  state.  Conse¬ 
quently,  optical  activity  can  be  due  either  to  the  structure  of  the 
molecules  of  the  substance  themselves  or  to  the  arrangement 
of  the  particles  in  the  crystal  lattice. 

35.5.2  In  optically  active  crystals  and  pure  liquids,  the  angle  cp 
of  rotation  of  the  plane  of  polarization  of  light  is  proportional 
to  the  thickness  l  of  the  layer  of  the  substance  through  which 
light  passes.  Thus  cp  =  al.  The  proportionality  factor  a  is 
called  the  rotatory  power,  or  specific  rotation.  The  rotatory  power 
depends  upon  the  nature  of  the  substance,  the  temperature  and 
the  wavelength  X0  of  the  light  in  vacuum.  The  dependence  of 
a  on  ?.Q  is  called  the  rotary,  or  rotatory,  dispersion.  Far  from  the 
band  of  light  absorption  by  the  substance,  rotary  dispersion 
obeys  Biot's  law :  a  OC  A.-2. 

35.5.3  The  majority  of  optically  active  crystals  exist  in  two 
versions.  As  light  passes  through  a  crystal  of  one  version,  said 
to  be  dextrorotatory  or  right-handed,  the  plane  of  polarization 
is  rotated  to  the  right,  i.e.  clockwise  (for  an  observer  looking 
toward  the  oncoming  beam).  In  the  passing  of  light  through  a 
crystal  of  the  other  version,  said  to  be  levorotatory  or  left-handed, 
the  plane  of  polarization  is  rotated  to  the  left  (counterclockwise). 
The  values  of  the  rotatory  power  for  the  two  versions  of  the  same 
optically  active  crystal  differ  only  in  sign. 

35.5.4  The  angle  of  rotation  of  the  plane  of  polarization  of  light 
in  travelling  a  path  of  length  l  in  an  optically  active  solution  is 

cp  =  [a]  cl  =  [a]  DKl. 
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Here  c  is  the  volume-mass  concentration  of  the  optically  active 
substance  in  the  solution  (in  kg/m3),  D  is  the  density  of  the 
solution,  and  K  —  clD  is  the  weight-part  concentration,  i.e. 
the  ratio  of  the  mass  of  the  optically  active  substance  to  the 
mass  of  the  whole  solution.  The  proportionality  factor  [a]  is 
called  the  rotatory  power ,  of  specific  rotation ,  of  the  solution. 
The  value  of  [a]  depends  on  the  nature  of  the  optically  active 
substance  and  of  the  solvent,  the  wavelength  of  the  light  and 
the  temperature. 

35.5.5  When  subjected  to  the  action  of  an  external  magnetic 
field,  an  optically  inactive  medium  acquires  the  capacity  to 
rotate  the  plane  of  polarization  of  light  propagating  along  the 
direction  of  the  field.  This  phenomenon  is  known  as  the  Faraday 
effect ,  or  magnetic  rotation  of  the  plane  of  polarization  of  light. 
The  angle  q>  of  rotation  of  the  plane  of  polarization  is  propor¬ 
tional  to  the  path  length  of  the  light  in  the  substance  and  to  the 
strength  II  of  the  magnetic  field.  Thus  cp  =  VIII.  The  propor¬ 
tionality  factor  V  is  called  the  Verdet  constant.  It  depends  upon 
the  nature  of  the  substance  and  the  wavelength  of  the  light. 

The  direction  of  magnetic  rotation  of  the  plane  of  polarization 
(for  an  observer  looking  along  the  magnetic  field)  is  the  same 
whether  the  light  propagates  along  the  direction  of  vector  H 
or  in  the  opposite  direction.  In  this  feature  the  Faraday  effect 
differs  from  the  rotation  of  the  plane  of  polarization  of  light 
in  natural  optically  active  media. 


CHAPTER  36  THERMAL  RADIATION 


36.1  Thermal  Radiation.  Kirchhoff's  Law 

36.1.1  All  bodies  emit  electromagnetic  waves  to  one  or  another 
degree.  Highly  heated  bodies,  for  instance,  glow,  whereas  at 
ordinary  temperatures  they  are  the  source  of  only  invisible 
infrared  radiation. 

Electromagnetic  radiation,  emitted  by  a  substance  and  produced 
at  the  expense  of  its  internal  energy  (9.1.2)  is  called  thermal, 
or  heat,  radiation.  It  depends  only  on  the  temperature  and  the 
optical  properties  of  the  radiating  bodies.  If  the  energy  consump- 
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tion  of  a  body  on  thermal  radiation  is  not  replenished  by  sup¬ 
plying  more  heat  to  the  body,  its  temperature  gradually  drops 
and  the  thermal  radiation  is  reduced. 


Radiative  heat  exchange  ( heat  exchange  through  radiation)  is  the 
spontaneous  process  of  transferring  energy  in  the  form  of  heat 
from  a  more  heated  body  to  a  less  heated  one.  It  is  accomplished 
by  thermal  radiation  and  the  absorption  of  electromagnetic 
waves  by  these  bodies. 

36.1.2  Thermal  radiation  is  the  only  kind  that  can  be  in  a  state 
of  thermodynamic  equilibrium  (8.3.3)  with  matter.  In  equilib¬ 
rium,  the  energy  consumption  on  thermal  radiation  is  compensat¬ 
ed  for  by  the  absorption  of  the 
same  amount  of  energy  by  the 
body  owing  to  the  radiation  fal¬ 
ling  on  it.  Radiative  ( radiation ) 
equilibrium  is  set  up  in  an  adia- 
batically  closed  system  (i.e.  one 
that  does  not  exchange  heat  with 
the  external  medium),  all  bodies 
of  which  are  at  the  same  temper¬ 
ature. 

It  follows  from  the  second  law 
of  thermodynamics  (11.3.2)  that 
radiative  equilibrium  is  independent  of  the  material  of 
the  bodies  that  make  up  the  closed  thermodynamically  equilib¬ 
rium  system.  The  volume  density  of  the  energy  of  radiation 
equilibrium  and  its  frequency  distribution  are  universal  func¬ 
tions  of  the  temperature.  As  a  matter  of  fact,  if  this  were  no- 
so,  it  would  be  possible  to  take  two  adiabatically  closed  systemst 
A  and  B  (Fig.  36.1),  at  the  same  temperature  TA  =  TB=  T, 
and  accomplish  radiative  heat  exchange  between  them  by  mak¬ 
ing  a  small  hole  in  the  heatproof  wall  separating  them.  If  the 
volume  density  of  energy  EA  of  system  A  is  greater  than  that 
Z?g  of  system  B,  i.e.  EAA  >  E"u ,  then,  owing  to  heat  exchange 
between  the  systems,  the  radiant  energy  in  system  A  and  its 
temperature  should  decrease  and  the  radiant  energy  in  system  B 
and  its  temperature  should  increase  until  the  values  of  the  vol¬ 
ume  energy  densities  in  systems  A  and  B  become  equal:  EA  =• 


Fig.  36.1 


=  EB  .  But,  in  this  case,  T'B  >  T'A,  so  that  the  process  being 
considered  contradicts  the  second  statement  of  the  second  law 
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of  thermodynamics  (11.3.2).  Consequently,  cannot  be  great¬ 
er  than  EdB.  In  exactly  the  same  way,  cannot  be  greater 
than  E'A,  i.e.  EA  —  Eg  =  ( T )  is  a  universal  function  of 

the  temperature. 

36.1.3  Serving  as  the  spectral  characteristic  of  radiative  equi¬ 
librium  is  the  spectral  density  of  the  volume  energy  denstiy  of 
this  radiation.  Thus 


P(v,  2’)  = 


dEa 
dv  ’ 


where  dE(l  is  the  energy  of  radiative  equilibrium  with  frequen¬ 
cies  from  v  to  v  +  dv  enclosed  within  unit  volume  of  the  radia¬ 
tion  field.  The  volume  energy  density  of  this  field  is 

OO 

Ed  =  ^  p(v,  T)  dv. 
o 

Radiation  equilibrium  is  isotropic ,  i.e.  it  is  not  polarized  and 
all  directions  of  its  propagation  are  equally  probable.  The  ener¬ 
gy  dE  of  radiation  equilibrium  in  vacuum  at  frequencies  from 
v  to  v  +  dv  incident  in  unit  time  on  unit  area  of  the  surface  of 
each  of  the  bodies  of  a  thermodynamically  equilibrium  system 
is  equal  to 

dE  =-|-  p  (v,  T)  dv, 


whore  c  is  the  velocity  of  light  in  free  space  (or  in  a  vacuum). 
36.1.4  The  radiant  exitance*  (total  emissive  power)  of  a  body  is 
the  physical  quantity  Me,  numerically  equal  to  the  energy  of 
electromagnetic  waves  of  all  possible  frequencies  (or  wavelengths) 
from  0  to  oo**  radiated  in  unit  time  from  unit  area  of  a  body’s 
surface. 


*  Formerly  called  the  radiant  omittance. 

**  It  is  practically  sufficient  to  bo  restricted  to  the  range  of 
frequencies  and  wavelengths  of  optical  radiation  (31.4.3). 
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The  emissive  power  (spectral  density  of  radiant  exitance )  of  a  body 
is  the  physical  quantity  numerically  equal  to  the  ratio  of  the 
energy  dE  radiated  in  unit  time  from  unit  area  of  the  surface  of 
the  body  by  electromagnetic  waves  in  a  narrow  range  of  fre¬ 
quencies  from  v  to  v  +  dv  (or  wavelengths  in  vacuum  from  X  to 
X  -f-  dX),  to  the  width  of  this  range: 

dE  dE  c 

rv~~dv~  or  ri~~dX’  ri ~ TF rv 


where  c  is  the  velocity  of  light  in  vacuum.  The  values  of  rv 
(or  rj,)  depend  upon  the  frequency  (or  wavelength),  temperature, 
chemical  composition  of  the  body  and  the  condition  of  its  sur¬ 
face. 

The  radiant  exitance  of  a  body  is  related  to  rv  and  r x  by  the 
equations 


Me 


rv  dv  = 


36.1.5  The  absorptance  (monochromatic  absorption  coefficient) 
of  a  body  is  the  dimensionless  quantity  av  which  indicates  what 
fraction  of  the  energy  of  electromagnetic  waves  of  frequencies 
from  v  to  v  -f-  dv,  incident  on  the  surface  of  the  body,  is  ab¬ 
sorbed  by  it.  Thus 


«v 


^abs  . 
dE )no 


The  value  of  av  depends  upon  the  frequency,  temperature,  chem¬ 
ical  composition  of  the  body  and  the  condition  of  its  sur¬ 
face. 

A  blackbody  (denoted  by  a  superscript  asterisk)  is  one  that  com¬ 
pletely  absorbs  all  incident  radiation,  regardless  of  its  direc¬ 
tion,  spectral  composition  and  polarization,  without  reflecting 
or  transmitting  any:  a*  =  1.  Though  a  blackbody  is  a  theoreti¬ 
cal  ideal,  it  can  be  nearly  realized  in  practice  by  an  almost  com¬ 
plete  enclosure  with  a  small  opening  (Fig.  36.2).  Light  entering 
the  enclosure  through  hole  O  undergoes  repeated  reflection  from 
the  walls.  At  this  the  energy  of  the  incident  light  is  practically 
completely  absorbed  by  the  walls  of  the  enclosure  regardless 
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of  the  material  of  which  they  are  made.  The  emissive  power  of 
a  blackbody  is  denoted  in  the  following  by  r*  (or  r*)  and  its 
radiant  exitance  by  M*. 

A  greybody  is  one  whose  absorptance  is  less  than  unity  and  is 
independent  of  the  frequency  (or  wavelength)  of  the  light,  the 
directions  of  its  propagation  and  polarization: 

dctf 

- =  0  and  agr  =  agr. 

v 


36.1.6  According  to  the  principle  oj  detailed  balance  any  micro¬ 
scopic  process  in  an  equilibrium  system  should  proceed  at  the 
same  rate  as  the  reverse  process.  This  prin¬ 
ciple  of  statistical  physics  enables  a  rela¬ 
tion  to  be  found  between  the  emissive 
power  rv  and  the  absorptance  av  of  any 
opaque  body.  Assume  that  the  body  is  part 
of  a  thermodynamically  equilibrium  sys¬ 
tem  at  the  temperature  T.  The  energy  ra¬ 
diated  per  unit  time  from  unit  area  of  the 
surface  of  the  body  being  considered  in 
the  frequency  range  from  v  to  v  +  dv  is  pjg,  35.2 
dErad  —  rvdv.  During  the  same  time,  the 
same  portion  of  the  surface  of  the  body  ab¬ 
sorbs  a  part  of  the  energy  of  the  radiation  equilibrium  (36.1.3) 
incident  on  this'surface.  It  is  equal  to  dEa i,s  — -  av  (cl 4)  p  (v,  T)dv. 
Since,  according  to  the  principle  of  detailed  balance,  dEt a(j  = 
=  dE  a)1s, 


-^*  =  ^p(v,  T). 

This  equation  expresses  Kirchhoff's  law  (for  radiation),  which 
states  that  the  ratio  of  the  emissive  power  of  a  body  to  its  ab¬ 
sorptance  does  not  depend  upon  the  nature  of  the  body  and  is 
equal  to  the  emissive  power  r*  of  a  blackbody  at  the  same  tem¬ 
perature  and  frequency  values. 

The  dependence  of  r*  on  v  and  T  is  called  Kirchhoff's  function: 
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36.1.7  It  follows  from  Kirchhoff’s  law  that  the  radiant  exi¬ 
tance  (36.1.4)  of  a  body  is  equal  to 

OO 

Me  =  j  a vr*  dv . 

0 

In  particular,  the  radiant  exitance  of  a  groybody  is 

Mf  =  >  where  M * 

is  the  radiant  exitance  of  a  blackbody  at  the  same  temperature. 
For  a  nongrey  body 

Me  -  PM*, 

where  P  is  the  total  degree  of  blackness  of  the  body.  It  depends 
on  the  material  of  the  body,  the  condition  of  its  surface  and  the 
temperature.  For  all  bodies  except  a  blackbody,  P  <  1. 

36.1.8  Radiation  equilibrium  at  the  temperature  T  is  identical 
to  thermal  radiation  of  a  blackbody  at  the  same  temperature. 
Therefore,  it  is  often  called  blackbody  radiation.  The  relation 
between  the  radiant  exitance  of  a  blackbody  and  the  volume  den¬ 
sity  of  the  energy  of  blackbody  radiation  is  of  the  form 

OO 

M*  =r±EA=-j-  j  p(v,  T)dv. 
o 


=  j  i  *  dv, 
0 


36.2  Stefan-Boltzmann  and  Wien  Laws 

36.2.1  The  Stefan-Boltzmann  law  states  that  the  radiant  exi¬ 
tance  of  a  blackbody  is  proportional  to  the  fourth  power  of  its 
absolute  temperature.  Thus 

M*  =  <t  T\ 

where  rr  —  5.07  X  If)-8  W-m ~2-K “4  is  the  Stefan-Boltzmann 
constant.  This  law  can  be  derived  theoretically  by  considering 
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the  radiative  equilibrium  in  an  enclosed  cavity  by  thermody¬ 
namic  methods. 

36.2.2  The  dependence  of  the  emissive  power  r*  of  a  blackbody 
on  the  frequency  v  for  several  constant  values  of  the  temperature 
is  shown  in  Fig.  36.3.  In  the  region  of  low  frequencies,  r*  CC 
cc  v2?',  whereas  for  the  region  of  high  frequencies  (the  right- 
hand  branch  of  the  curves  far  away  from  the  maxima)  r*  oc 
cc  v3  exp  [— ax\IT),  where  a1  is  a  constant  factor. 

The  radiant  energy  of  a  blackbody  is  nonuniformly  distributed 
in  its  spectrum.  A  blackbody  displays  almost  no  radiation  in 


Fig.  36.3  .  Fig.  36.4 

the  regions  of  very  high  and  very  low  frequencies.  As  the  tem¬ 
perature  is  raised,  the  maximum  of  r*  is  shifted  toward  the 
higher  frequencies  in  accordance  with  the  law:  vm  =  I^T,  where 
vm  is  the  frequency  corresponding  to  maximum  r*  at  the  tem¬ 
perature  T,  and  bj  is  a  constant  factor. 

The  dependence  of  the  emissive  power  of  a  blackbody  r%  — 
=  cr*/X2  (36.1.4)  on  the  wavelength  X  is  shown  in  Fig.  36.4. 
As  the  temperature  of  the  blackbody  is  raised,  the  maximum  of 
rj  is  shifted  toward  the  shorter  wavelengths  in  accordance  with 
the  Wien  displacement  law'. 


where  b  —  2.1)  X  10-3  m-K  is  the  Wien  constant. 
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36.2.3  All  attempts  that  were  made  failed  to  theoretically  sub¬ 
stantiate  the  experimentally  found  form  of  Iiirchhoffs  func¬ 
tion,  r*=  f  (v,  T),  shown  in  Fig.  36.3,  within  the  framework 

of  classical  physics.  Using  thermodynamic  methods,  the  Wien 
formula  was  found: 

r*  —  v®cp  j  , 

where  cp  (v/7')  is  an  unknown  function  of  vlT. 

Rayleigh  and  Jeans  applied  the  laws  of  electrodynamics  and 
oquipartition  of  energy  in  an  equilibrium  system  (10.6.4)  to 
obtain  the  Rayleigh- Jeans  lam: 


231V2 


k'T , 


where  k  is  Boltzmann's  constant  (8.4.5). 

The  Rayleigh- Jeans  law  agreed  with  experimental  data  only 
in  the  region  of  low  frequencies.  Moreover,  it  leads  to  the  ab¬ 
surd  conclusion  that  at  any  temperature  the  radiant  oxitance 
M*  of  a  blackbody  and  the  volume  density  of  energy  EA  of 
radiation  equilibrium  are  iniinitely  large.  This  conclusion,  ar¬ 
rived  at  by  classical  physics  in  the  problem  of  explaining  the 
spectral  distribution  of  radiation  equilibrium,  has  been  aptly 
named  the  “ultraviolet  catastrophe”. 


36.3  Planck's  Formula 

36.3.1  The  volume  density  of  energy  of  radiation  equilibrium 
(blackbody  radiation)  in  a  closed  cavity,  as  well  as  the  energy 
distribution  of  this  radiation  in  frequency  are  independent  of 
the  material  of  the  cavity  walls  and  are  fully  determined  by  the 
temperature.  Therefore,  an  infinite  system  of  harmonic  oscilla¬ 
tors  with  all  possible  natural  frequencies  can  be  taken  as  a  theo¬ 
retical  model  of  a  blackbody.  Eacliof  such  oscillators  corresponds 
to  a  monochromatic  component  of  blackbody  radiation.  Let 
(A’v)  be  the  average  energy  of  an  oscillator  with  the  natural 
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frequency  v.  Then,  according  to  calculations,  the  radiant  exi- 
lance  of  a  blackbody  is 


2jiv2 


(Ev). 


If  kT  is  taken  for  (Ev),  as  follows  from  the  classical  law  of  the 
oquipartition  of  energy  (10.6.4)*,  then  the  preceding  expression 
for  r*  coincides  with  the  Rayleigh- Jeans  law  (36.2.3). 

36.3.2  Planck  was  able  to  find  the  correct  expression  for  the 
average  energy  (Ev)  of  an  oscillator  and  for  the  Kirchhoff 
function  by  introducing  the  quantum  hypothesis,  which  was  en¬ 
tirely  alien  to  classical  physics.  It  is  assumed  in  classical  phys¬ 
ics  that  the  energy  of  any  system  can  vary  in  a  continuous  man¬ 
ner,  i.e.  that  it  can  take  any  arbitrarily  close  consecutive  val¬ 
ues.  According  to  Planck’s  quantum  hypothesis,  the  energy 
Ev  of  an  oscillator  can  have  only  definite  discrete  ( quantized ) 
values,  equal  to  a  whole  number  of  elementary  portions,  called 
energy  quanta  Ev0.  Thus 

Ev  =  nEv0,  where  n  =  0,  1,  2,  .... 

If  it  is  assumed  that* the  oscillator  distribution  in  possible  dis¬ 
crete  energy  states  is  described  by  the  Boltzmann  distribution 
law  (10.4.2),  the  probability  pn  of  finding  the  oscillator  in  a 
state  with  the  energy  nEVg  at  the  temperature  1  is  equal  to 

Pn  =  C  exp  [— nEvJkT]. 

Here  k  is  the  Boltzmann  constant  (8.4.5)  and  C  is  a  constant 
factor  determined  from  the  normalization  condition: 


2  Pn  =  l,  i-C 

n=0 


c=- 


i 


2]  exp  [  —  nEv0/kT] 

n= 0 


*  One  oscillatory  degree  of  freedom  of  an  oscillator  accounts  for 
twice  as  much  energy,  on  an  average,  as  one  degree  of  transla¬ 
tional  or  rotary  motion,  because  the  oscillator  has  not  only 
kinetic,  but  potential  energy  as  well.  On  an  average,  this  poten¬ 
tial  energy  is  equal  to  the  kinetic  energy. 
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The  average  value  of  the  oscillator  energy  is 

oo 

oo  V,  n  exp  [  —  nEV0/kT] 


( E\ ) —  ^  PnuEV0  —  A'vo 


n—0 


n— 0 


from  which 


(Ev)  —  hv 


V  exp  [  —  nEvu/lcT] 
n= 0 


d\ 


2  exp(  —  nl) 


n= 0 


2  exp  ( —  rag) 
n=0 


*  -  “2r 1,1  2exp(— 

n—0 

where  |  =Ev0/kT.  Since 


2  exp(— «s)  = 


1 


1  — exp(— g) 


n  —  0 
and 

OO 

—  In  2  exp  (  — »g)  =  ln  (1  — e"*)t 

n—0 

then 

E. 


<£v>  =  - 


and  r*  = 


-l 


c2  ^EVQ/hT _ ! 


Comparing  this  expression  for  r*  with  the  Wien  formula  (30.2.3), 
it  follows  that  a  quantum  of  energy  is  equal  to 

*v.  = 
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where  h  is  a  universal  constant,  called  Planck’s  constant  (Appen¬ 
dix  II). 

Planck's  radiation  formula  i'or  the  emissive  power  r*  of  a  black- 
body  is 

4  2jiv-  hv 

rv~  ^  ghv/hT ^  • 


Accordingly,  the  spectral  density  of  radiation  equilibrium  (36.1.3) 
is  equal  to 


P(v,  T)  = 


8itv2 

c3 


hv 

ghv/hT  _  i  ■ 


36.3.3  At  low  frequencies  (Av<  IcT),  exp  [hv/IcT  —  1]  » 
«  hvlk T  and  the  Planck  radiation  formula  coincides  with  the 
Itayleigh- Jeans  law  (36.2.3). 

Following  from  Planck’s  radiation  formula  is  the  Stefan-Boltz- 
maun  law  (36.2.1): 


dv 

ehv/kT_l 

0 


2j xIPT* 
cV 


l‘  q?  dr\  2 n5ic4 

J  —  15c2/t3 

0 


r4. 


The  Planck  constant  is  related  to  the  Stofan-Boltzmann  con¬ 
stant  a  (36.2.1)  as  follows: 


h  = 


\/~  2jt5/c4 
V  15c‘“0 


—  6.62  X  10~34  J-s. 


36.3.4  The  Planck  formula  for  the  emissive  power  of  a  black- 
body  is 

*  _  2 nc~h  1 

ehL'/khT-l' 
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The  wavelength  Xm  corresponding  to  maximum  r%  is  determined 

from  the  transcendental  equation 

xex  —  5ex  +  5  =  0,  where  x  =  hcl\mkT. 

The  root  of  this  equation  x  —  4.965,  and  satisfies  the  Wien 
displacement  law  (36.2.2): 

h  c 

XmT  —  b,  where  b=  ,  .  „  —  2.9  x  10~3  in-K. 

4 . 965  k 


36.4  Optical  Pyrometry 

36.4.1  Optical  pyromelry  is  the  name  given  to  miscellaneous 
techniques  for  measuring  high  temperatures  based  on  the  laws 
of  thermal  radiation.  The  instruments  used  for  this  purpose  are 
called  pyrometers. 

Radiation  pyrometers  record  the  integrated  radiation  of  the  hea¬ 
ted  body  being  investigated;  optical  pyrometers  record  the  ra¬ 
diation  of  a  heated  body  in  one  or  two  narrow  portions  of  the 
spectrum. 

36.4.2  The  radiant  flux  <De  is  the  average  power  of  optical  radia¬ 
tion  (31.4.3)  during  a  time  considerably  greater  than  a  period 
of  oscillation  of  the  electromagnetic  field  of  the  light. 

The  irradiance  Ee  ( radiant  flux  density)  of  a  surface  is  the  ratio 
of  the  flux  of  incident  radiation  on  this  surface  to  its  area.  Thus 
Ee  =  dOjdS,  where  dOe  is  the  radiant  flux  incident  on  a  por¬ 
tion  of  the  surface  of  area  dS. 

The  radiant  intensity  I e  of  a  radiative  body  is  the  radiant  flux 
emitted  by  the  body  within  a  unit  solid  angle.  Thus  /e  = 
=  d<t>e/d£2,  where  dOe  is  the  radiant  flux  within  the  solid  angle 
dQ. 

36.4.3  The  radiance  La  of  the  portion  dS  of  a  radiative  surface 
in  a  given  direction  is  the  ratio  of  the  radiant  intensity  dle 
of  area  dS  in  the  direction  being  considered  to  the  area  of  the 
projection  of  dS  on  a  plane  perpendicular  to  the  direction.  Thus 

,  4/e 

""  dS  cos  iji  ’ 

where  q>  is  the  angle  between  the  specified  direction  and  the 
normal  to  the  surface  of  area  dS. 


:if>.4]  OPTICAL  PYROMETRY 


523 


The  spectral  density  of  the  radiance  is  the  ratio  of  the  radiance 
dLe,  corresponding  to  a  narrow  band  of  the  optical  spectrum, 
to  the  width  of  this  band.  Thus 

dL/Q  .  dlja 

b'>  =  —  and  ^  = 

36.4.4  A  source  of  optical  radiation  is  said  to  obey  Lambert's 
law,  or  to  be  a  cosine  radiator,  if  its  radiance  Le,  as  well  as  its 
spectral  densities  bv  and  bx  are  the  same  in  all  directions,  i.e. 
are  independent  of  angle  q>.  A  blackbody  could  be  called  a  cosine 
radiator. 

The  radiance  of  a  cosine  radiator  and  its  spectral  densities  are 
related  to  its  radiant  exitance  and  its  spectral  densities  (36.1.4) 
us  follows: 


MP 


36.4.5  In  optical  pyrometry,  distinction  is  made  between  ra¬ 
diation,  brightness  and  colour  temperatures  of  a  body. 

The  radiation  temperature  Tr  of  a  body  is  the  temperature  of 
a  blackbody  at  which  its  radiance  L*  is  equal  to  the  radiance 
Le  of  the  given  body.  If  the  body  being  investigated  is  a  cosine 
radiator,  whose  total  degree  of  blackness  is  p  (36.1.7),  then, 
from  the  condition  that  Ze  ( T )  =  L*  ( Tr ),  where  T  is  the  true 
temperature  of  the  body,  it  follows  that 

p  M%(T)  =  M*(Tr)  and  Tm^>Tr. 


36.4.6  The  brightness  (luminance)  temperature  7'j,  of  a  body  is 
the  temperature  of  a  blackbody  at  which  its  spectral  density 
bf  of  radiance  fora  definite  wavelength  X0  is  equal  to  the  spec¬ 
tral  density  l>x  of  radiance  for  the  given  body  for  the  same  wave¬ 
length:  l>x  (A0,  T)  — -  bf  (X0,  7T>).  Usually  L0  660  tun  (red 
light). 

For  a  cosine  radiator,  whose  absorptance  (36.1.5)  for  light  of 
the  wavelength  L0  at  the  temperature  T  of  the  body  is  equal  to 
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a%  (X„,  T),  it  follows  from  Kirchhol'f’s  law  (36.1.6)  and  from 
Planck’s  formula  (36.3.4)  that 

MV  T)  r£(X0,  T)  =  rt(K0,  Tb) 

and 

a,  (X0,  T)  [  oxp  (  7^7  )  1 J  **  CXP  (  iy  )  “  1  ’ 

where  «2  =  hc/k.  Since  (X0,  7’)^  1,  then  T  ISs  T 

36.4.7  The  colour  temperature  Tc  of  a  body  is  the  temperature  of 
a  blackbody  at  which  the  relative  distributions  of  the  spectral 
density  of  the  radiance  b*  of  the  blackbody  and  that  b i  of  the 
body  being  considered  are  close  to  one  another  to  the  maximum 
degree  in  the  visible  region  of  the  spectrum.  Thus 

MV  T)  ,,  Tc) 

MV  T)  -  h*(X2,  Tc)  ■ 

For  a  cosine  radiator 
MX,,  T)  Tc) 

MV  T)  rJ(X„  To)  ■ 

As  a  rule,  X,  =  655  nm  (rod  light)  and  X2  —  470  nm  (green 
light).  The  colour  temperature  of  a  greybody  (36.1.5)  coincides 
with  its  true  temperature  and  can  be  found  by  Wien’s  displace¬ 
ment  law  (36.2.2). 


CHAPTER  37  FUNDAMENTALS  OF  QUANTUM 
OPTICS 

37.1  External  Photoelectric  Effect 
(Photoemissive  Effect) 

37.1.1  Quantum  optics  is  the  branch  of  optics  that  studies  phe¬ 
nomena  in  which  the  quantum  properties  of  light  are  manife¬ 
sted.  Such  phenomena  include  thermal  radiation  (36.1.1),  the 
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photoelectric  effect,  the  Compton  effect  (37.3,1)  and  photo¬ 
chemical  processes. 

The  photoelectric  effect,  in  gases  consists  in  the  ionization  of  the 
atoms  and  molecules  of  the  gas  due  to  the  action  of  light,  and 
is  usually  called  photoionization. 

In  condensed  bodies  (liquids  and  solids)  distinction  is  made 
between  the  external  and  internal  photoelectric  effects. 

The  external  photoelectric  effect  ( photoemission )  is  the  emission 
of  electrons  by  a  substance  subject  to  the  action  of  light.  Elec¬ 
trons  ejected  from  the  substance  in  the  external  photoelectric 
effect  are  called  pholoelectrons,  and  the  electric  current  formed 
by  them  in  their  ordered  motion  in  the  external  electric  field 
is  called  the  photoelectric  current. 

The  internal  photoelectric  effect  is  the  redistribution  of  electrons 
among  the  energy  states  in  solid  and  liquid  semiconductors  and 
dielectrics  due  to  the  action  of  light.  It  is  manifested  in  the 
change  in  the  concentration  of  current  carriers  in  a  medium 
(20.1.4)  and  leads  to  photoconduction  or  the  barrier-layer  pho¬ 
toelectric  effect.  Photoconduction  is  an  increase  in  the  electrical 
conductivity  of  a  substance  due  to  light  falling  on  it.  The  har¬ 
rier-layer  photoelectric  effect  is  the  generating  of  an  emf  (photo- 
omf)  by  the  action  of  light  in  a  system  consisting  of  a  semicon¬ 
ductor  and  metal  in  contact  or  two  different  kinds  of  semicon¬ 
ductors  [for  example  at  a  p-n  junction  (41.11.7)]. 

37.1.2  Shown  in  Fig.  37.1  is  an  installation  for  investigating 
the  external  photoelectric  effect  in  metals.  Light  enters  through 
window  D  and  falls  on  the  surface  of  cathode  K,  which  is  inside 
a  vacuum  tube  and  is  called  the  photoelectric  cathode.  The  shape 
of  the  curve,  representing  the  dependence  of  the  photoelectric 
current  I  in  the  tube  on  the  potential  difference  U  between  the 
anode  A  and  cathode  K  upon  constant  irradiance  Ee  of  the  cath¬ 
ode  (36.4.2)  by  monochromatic  light,  is  shown  in  Fig.  37.2. 

The  existence  of  a  photoelectric  current  at  negative  values  of  U 
from  0  to  —  U0  indicates  that  the  photoelectrons  have  a  certain 
initial  velocity  when  emitted  from  the  cathode,  and  the  corre¬ 
sponding  kinetic  energy.  The  maximum  initial  velocity  %,ax  of 
the  photoelectrons  is  related  to  the  retarding  voltage  ( retarding 
potential)  U0  by  the  equation 


mv 


2 


max  __  gif 
2 


n , 


526 


FUNDAMENTALS  OF  QUANTUM  OPTICS  [37 


where  e  and  m  are  the  absolute  values  of  the  charge  and  the  mass 
of  the  electron. 

The  photoelectric  current  increases  with  U  only  to  a  definite 
limiting  value  7S  called  the  saturation  photoelectric  current.  At 
this  value,  all  the  electrons  escaping  from  the  cathode,  due  to 


Fig.  37.1  Fig.  37.2 

the  effect  of  light, 'reach  the  anode.  If  ??sec  is  the  number  of  pho¬ 
toelectrons  emitted  from  the  cathode  per  second,  then 

Is  ~  ensec. 

37.1.3  The  external  photoelectric  effect  obeys  the  following 
laws. 

1.  The  Sloletov  law  states  that  at  a  constant  spectral  composition 
of  the  light  incident  upon  the  photoelectric  cathode,  the  satu¬ 
ration  photoelectric  current  is  proportional  to  the  irradiance 
(36.4.2)  of  the  cathode: 

h  «  Ee  and  resec  C c  Ee. 

2.  For  a  given  photoelectric  cathode,  the  initial  velocity  of  the 
photoelectrons  depends  upon  the  frequency  of  the  light,  but 
not  on  its  intensity. 

3.  A  photoelectric  threshold  exists  for  each  photoelectric  cathode, 
i.e.  the  minimum  frequency  vn  of  light  at  which  the  external 
photoelectric  effect  is  still  feasible.  This  frequency  v0  depends 
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upon  the  material  of  the  photoelectric  cathode  and  the  condition 
of  its  surface. 

The  second  and  third  laws  cannot  be  interpreted  on  the  basis 
of  the  classical  electromagnetic  theory  of  light.  According  to 
this  theory  the  removal  oE  conduction  electrons  from  the  metal 
is  a  result  of  their  “swaying”  in  the  electromagnetic  field  of  the 
light  wave  so  that  they  are  “loosened”.  This  effect  should  increa¬ 
se  with  the  light  intensity  and  with  the  irradiance  of  the  photo¬ 
electric  cathode,  proportional  to  the  intensity. 

37.1.4  Only  the  quantum  theory  of  light  enabled  the  laws  of 
the  external  photoelectric  effect  to  be  properly  explained.  In 
a  further  development  of  Planck’s  ideas  on  the  quantization 
of  the  energy  of  atoms  (oscillators)  (36.3.2),  Einstein  proposed 
the  hypothesis  that  light  is  not  only  radiated,  but  also  propa¬ 
gates  in  space  and  is  absorbed  by  matter  in  the  form  of  separate 
discrete  quanta  of  electromagnetic  radiation,  which  were  nam¬ 
ed  photons.  All  photons  of  monochromatic  light  of  the  frequen¬ 
cy  v  have  the  same  energy  Ev\,  =  hv,  where  h  is  Planck’s  constant, 
and  travel  in  space  at  the  velocity  c  of  light  in  vacuum  (or  in 
free  space).  In  the  absorption  of  light  each  absorbed  photon 
transmits  all  of  its  energy  to  a  particle  of  the  absorbing  sub¬ 
stance.  In  the  external  photoelectric  effect,  for  example,  a  con¬ 
duction  electron  of  the  metal,  in  absorbing  a  photon,  receives 
all  of  its  energy  hv.  The  work  function  <t»  (41.11.1)  is  the  mini¬ 
mum  energy  that  it  is  necessary  to  impart  to  the  electron  for 
it  to  escape  from  a  metal.  Hence,  Einstein  s  photoelectric  emis¬ 
sion  equation,  expressing  the  law  of  the  conservation  of  energy 
in  the  photoelectric  effect,  is  of  the  form 


37.1.5  The  second  law  of  the  photoelectric  effect  follows  directly 
from  Einstein’s  equation: 


Thus,  i>raax  and  Un  depend  only  upon  the  frequency  of  the  light 
and  the  work  function  in  emitting  an  electron  from  the  photo¬ 
electric  cathode. 
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The  maximum  initial  kinetic  energy  of  Ike  pkotoeleetrons  de¬ 
pends  upon  tke  light  frequency  according  to  a  linear  relation. 
It  becomes  zero  at  the  frequency  v,„  corresponding  to  the  pho¬ 
toelectric  threshold.  Thus 


V„ 


0 i 
h  • 


Consequently,  the  photoelectric  threshold  depends  only  on  the 
electron  work  function  of  the  metal. 

37.1.6  The  photoelectric  effect  is  inertialess,  i.e.  the  emission 
of  photoelectrons  begins  immediately,  as  soon  as  light  with  the 
frequency  v0  falls  on  the  photoelectric  cathode.  This  feature 
of  the  external  photoelectric  effect  is  still  another  confirmation 
of  the  quantum  nature  of  interaction  between  light  and  matter. 
According  to  classical  wave  conceptions,  quite  considerable 
time  is  required  for  an  electromagnetic  wave  of  given  intensity 
to  impart  sufficient  energy  to  the  electron  for  its  emission  from 
the  metal. 

37.1.7  At  very  high  intensities  of  light,  achieved  by  means 
of  lasers  (39.8.8),  a  multiple-photon,  or  nonlinear,  photoelectric 
effect  is  observed.  In  this  case,  the  electron  can  simultaneously 
receive  the  energy  of  N  photons  rather  than  that  of  one  photon. 
Then  the  equation  of  the  law  of  conservation  of  energy  in  the 
external  photoelectric  effect  under  the  action  of  light  of  frequen¬ 
cy  v  is  of  the  form 


Nhv  =  <t> 


mv 


2 

max 


2 


The  photoelectric  threshold  of  the  A'-photon  photoelectric  effect 
is 

(v0)jv  = 


37.2  Mass  and  Momentum  of  the  Photon. 

Light  Pressure 

37.2.1  The  mass  of  the  photon  (37.1.4)  having  the  energy  Ep\,  = 
~  hv,  where  v  is  the  frequency  of  the  light  and  h  is  Planck’s 
constant,  can  be  found  from  the  inass-energy  relation  (5.7.2). 
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Thus 


hv 

mPh  =  -^T  > 


where  c  is  the  velocity  of  light  in  free  space  (or  in  a  vacuum). 
The  photon  always  travels  with  the  velocity  c,  and  its  rest  mass 
(5.6.1)  equals  zero. 

37.2.2  The  momentum  pph  of  the  photon  is  numerically  equal  to 
fcv  h 

Pph  —  mphc  —  ——  —  y  i 


where  X  is  the  wavelength  of  the  light  in  vacuum.  Since  the 
wavenumber  is  k  —  2n/X, 


hk  and 


pPh  =  />k, 


where  h  =  h/2n,  and  k  is  the  wave  vector  (30.2.7). 

37.2.3  Light  exerts  a  pressure  on  the  bodies  that  reflect  or  ab¬ 
sorb  it.  In  quantum  pptics,  light  pressure  is  inlerpreted  as  the 
result  of  the  transfer  of  momentum  to  bod¬ 
ies  by  photons  as  the  light  is  reflected  or 
absorbed.  The  light  pressure  p  on  the  plane 
surface  ab  of  a  body  (Fig.  37.3)  is  equal  to 
the  numerical  value  of  the  normal  compo¬ 
nent  of  the  total  momentum  transferred 
by  photons  to  the  body  on  unit  area  of  the 
surface  being  considered  during  unit  time. 

Assume  that  monochromatic  light  of  fre¬ 
quency  v  is  incident  upon  surface  ab  at  the  Fig.  37.3 
angle  i  (Fig.  37.3)  and  that  reseo  is  the 
number  of  photons  incident  per  second  upon  unit  area  of  sur¬ 
face  ab.  If  R  is  the  coefficient  of  reflection  (31.5.6)  of  light  from 
the  surface  being  considered,  then,  of  the  nsec  photons,  Rnsec 
are  reflected  and  (1  —  R)  nsec  are  absorbed.  The  photons  being 
reflected  transfer  a  total  momentum  to  the  body,  directed  normal 
to  the  surface  ab  and  numerically  equal  to 


»  2Av  . 
•ft^sec  — — —  COS 


i. 


34-0186 
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The  photons  being  absorbed  transfer  a  total  momentum  to  the 
body,  whose  component  normal  to  surface  ab  is  numerically 
equal  to 

hv 

(l  —  R)nsec  — j-cosi. 

Thus  the  light  pressure  equals 

n  2Av  .  .  hv 

P  =  Rnst,c - cost  +  (1  —  R)  rcSec - C0SI 

c  c 

hv 

=  (1  +  7?)  nsec  — —  cos  i . 

If  n0  is  the  photon  concentration  of  the  incident  light,  then 
n3ec  =  n0c  cos  i  and  n0hv  =  (/+ )  is  the  average  value  of  the 
volume  energy  density  of  the  light.  Hence 

p  =  n0hv  (1  +  R)  cos2  i  —  (E1*)  (1  R)  cos2  i. 

Thus,  the  pressure  of  light  can  be  equally  well  explained  by 
either  wave  theory  (31.2.6)  or  quantum  theory. 


37.3  Compton  Effect 

37.3.1  The  Compton  effect  is  the  change  in  the  wavelength  of 
X  rays  when  they  are  scattered  by  a  substance  containing  light¬ 
weight  atoms.  The  wavelength  X'  of  the  radiation,  scattered  at 
the  angle  0  to  the  direction  of  propagation  of  the  primary  mono¬ 
chromatic  radiation  of  wavelength  X,  is  longer  than  X  by  the 
amount  AX,  which  depends  only  on  angle  0.  Thus 

AX  —  X'  —  X  =  2Xq  sin2  -5- . 

Lt 

The  constant  Xq  =  2.43  X  10~12  m  is  called  the  Compton  wave¬ 
length  of  the  electron.  The  Compton  effect  cannot  be  explained 
on  the  basis  of  the  classical  wave  theory  of  light. 

37.3.2  According  to  quantum  theory,  the  Compton  effect  is 
the  result  of  elastic  collision  of  the  X-ray  photon  with  a  free  or 
effectively  free  electron  (in  light  atoms,  the  binding  energy  of 
an  electron  with  the  atom  is  substantially  less  than  the  energy 
of  an  X-ray  photon).  In  this  collision  the  photon  transfers  a 
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part  of  its  energy  and  a  part  of  its  momentum  to  the  electron 
in  accordance  with  the  laws  of  the  conservation  of  energy  and 
momentum.  If  the  electron  was  ini¬ 
tially  at  rest,  it  follows  from  the  law  rrw 

of  conservation  of  energy  that 


hv  +  m0c 2  =  h\'  +  me2, 


(a) 


where  v  =  elk  and  v'  =  elk'  are  the 
frequencies  of  the  incident  and  scat¬ 
tered  radiation,  m0c2  is  the  rest  ener¬ 
gy  of  the  electron  (5.7.3),  and  me 2 
is  the  total  energy  of  the  electron 
(5.7.2)  after  the  collision  (energy  of 
the  Compton  recoil  electron). 

It  follows  from  the  law  of  conservation  of  momentum  that 


Pph  =  m\  +  Pph, 


or,  in  accordance  with  Fig.  37.4, 

/  w  /  }iv  \2  ,  /  h\'  \2  h\-v'  „  ... 

(my)  =(— )  +(—)  -2-jr-cosO,  (b) 


where  v  is  the  velocity  of  the  Compton  recoil  electron. 

The  mass  of  the  'recoil  electron  is  related  to  its  velocity  v  by 
the  equation  (5.6.1) 

m0  .  , 

m  —  .  (c) 

/ 1  ~(v/c)2 

Combining  equations  (a),  (b)  and  (c),  we  obtain 

0 

c2  (v  —  v')  =  2 hW  sin2  — 


or 


ts.k  —  k'  —  k  — 


2  h 


0 

sin2  — . 
muc  l 


34* 
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Thus,  the  Compton  wavelength  of  the  electron  is 
o  mo  c 


37.3.3  The  kinetic  energy  of  the  recoil  electron  (5.7.1)  is 
Ek={m  —  m0)  c2  =  h  (v  —  v')  =  ftvv'  =  jjt-  (*■'  —  X.) 


or 


Ek  —  hv 


AX 

- //V 

X  +  AX 


2a  sin2-^- 
l-|-2asin2|- 


where  o  =  Xc/X,  and  hv  is  the  energy  of  the  incident  photon. 

The  energy  E\  is  maximal  at  0  =  it.  Thus 

„  ‘  2 ahv 

£kmax“T+2^- 

37.3.4  When  an  electron  is  tightly  bound  to  its  atom,  a  photon 
scattered  by  the  electron  transfers  energy  and  momentum,  not 
to  the  electron,  but  to  the  atom  as  a  whole.  The  mass  of  the  atom 
is  many  times  greater  than  that  of  the  electron.  Hence,  only 
a  small  part  of  the  photon’s  energy  is  transferred  to  it,  so  that 
the  wavelength  X'  of  the  scattered  radiation  does  not  differ  prac¬ 
tically  from  the  wavelength  X  of  the  incident  radiation.  The 
percentage  of  electrons  that  are  tightly  bound  in  their  atoms 
increases  with  the  atomic  number  of  the  element  and,  conse¬ 
quently,  with  the  atomic  mass.  Therefore,  the  heavier  the  atoms 
of  the  scattering  substance,  the  greater  the  relative  intensity 
of  the  unshifted  component  (X'  =  X)  in  the  scattered  radiation. 

37.3.5  In  contrast  to  the  scattering  of  photons,  accomplished 
by  both  free  and  bound  electrons,  only  bound  electrons  can  ab¬ 
sorb  photons.  For  example,  in  the  external  photoelectric  effect, 
a  photon  is  absorbed  by  a  bound  electron,  which  expends  a  part 
of  the  energy  it  receives  on  the  work  function  (i.e.  in  doing  work 
to  escape  from  the  substance),  the  work  function  being  a  mea¬ 
sure  of  how  strongly  the  electron  was  bound  in  the  substance. 
The  absorption  of  a  photon  by  a  free  electron  is  impossibles, 
because  this  process  would  contradict  the  laws  of  conservation 
of  energy  and  momentum.  This  can  be  readily  shown  by  consider- 
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ing  the  absorption  of  a  photon  by  a  stationary  free  electron. 
It  follows  from  the  laws  of  conservation  of  energy  and  momen¬ 
tum  that  the  following  two  relations  must  be  simultaneously 
complied  with: 

(m  —  m„)c2  =  hv  and  m»  =  —  ,  where  m— — ■.  m°  —  , 

c  / 1 —(vie)2 

13ut  these  equations  are  simultaneous  only  at  v  =  0. 


37.4  Wave-Particle  Duality  of  the  Properties  of  Light 

37.4.1  Such  phenomena  as  interference  (30.5.2)  and  diffraction 
of  light  (33.2.1)  are  convincing  proof  of  the  wave  nature  of 
light.  At  the  same  time,  the  laws  of  thermal  radiation  equilib¬ 
rium  (36.1.2),  the  photoelectric  effect  (37.1.1)  and  the  Comp- 
Inn  effect  (37.3.1)  can  be  successfully  interpreted  only  on  the 
basis  of  quantum  concepts  of  light  as  being  a  flux  of  discrete 
photons  (37.1.4).  But  the  wave  and  quantum  (particle)  methods 
of  describing  light  mutually  complement,  rather  than  contra¬ 
dict,  each  other,  because  light  simultaneously  possesses  both 
wave-  and  particlelfke  properties.  Light  is  a  dialectic  unity  of 
these  opposite  properties. 

37.4.2  The  principal  equations  relating  the  wavelike  proper¬ 
ties  of  light  (frequency  v  and  wavelength  X  in  vacuum)  to  its 
particlelike  properties  (the  energy  Evh  and  momentum  pph 
of  a  photon)  are: 

/■;ph  =  hv  and  pl)h= . 

The  wavelike  properties  of  light  play  the  decisive  role  in  the 
laws  of  its  propagation,  interference,  diffraction,  and  polari¬ 
zation;  whereas  the  particlelike  properties  play  no  less  impor¬ 
tant  a  role  in  the  interaction  of  light  with  matter.  The  longer 
the  wavelength  of  light,  the  less  the  momentum  and  energy  of 
the  photon,  and  the  more  difficult  it  is  to  detect  the  quantum 
properties  of  light.  For  example,  the  external  photoelectric 
effect  occurs  only  at  energies  of  a  photon  greater  than  or  equal 
to  the  work  function  in  releasing  an  electron  from  a  substance 
(37.1.5).  The  shorter  the  wavelength  of  electromagnetic  radia- 
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tion,  the  higher  the  energy  and  momentum  of  the  photons,  and 
the  more  difficult  it  is  to  detect  the  wavelike  properties  of  this 
radiation.  For  example,  X  rays  are  diffracted  only  by  an 
extremely  “thin  refraction  grating,  the  crystal  lattice  of  a 
solid  (33.3.4). 

37.4.3  Quantum  optics  employs  the  statistical  approach  in 
dealing  with  the  laws  of  light  propagation.  According  to  this 
approach,  the  diffraction  of  monochromatic  light  by  some  ob¬ 
stacle  (for  instance,  by  a  diffraction  grating)  consists  in  the  re¬ 
distribution  of  the  photons  in  space  caused  by  this  obstacle. 
The  probability  that  the  photons  will  hit  various  points  on 
a  screen  mounted  behind  the  obstacle  is  not  the  same.  This  ex¬ 
plains  why  a  diffraction  pattern  appears  on  the  screen:  The 
irradiance  Ee  (36.4.2)  of  some  small  portion  dS  of  the  screen’s 
surface  is  proportional  to  the  number  dnsec  of  photons  incident 
upon  this  portion  ( dS )  per  recond.  Consequently,  Ee  is  propor¬ 
tional  to  the  probability  of  photons  hitting  unit  area  of  the 
screen’s  surface  at  the  point  being  considered.  On  the  other 
hand,  according  to  wave  conceptions,  Ee  is  proportional  to  the 
square  of  the  amplitude  A  of  the  light  at  the  same  point  on  the 
screen.  Hence,  the  square  of  the  amplitude  of  the  light  wave 
at  some  point  in  space  is  a  measure  of  the  probability  of  a  pho¬ 
ton  being  at  this  point. 

37.4.4  Experiments  in  light  diffraction  indicate  that  in  chang¬ 
ing  the  intensity  of  a  light  flux  incident  on  an  obstacle,  the 
appearance  of  the  diffraction  pattern,  i.e.  the  relation  between 
the  irradiances  at  various  points  of  the  screen,  remains  un¬ 
changed.  This  indicates  that  wavelike  properties  are  inherent  in 
each  separate  photon,  and  not  only  in  an  assembly  of  a  great 
number  of  simultaneously  travelling  photons.  As  a  photon  pas¬ 
ses  through  an  optical  system  it  is  impossible  to  indicate  the 
particular  point  of  the  screen  it  will  hit.  It  is  feasible  only  to 
speak  of  the  probability  dEA  that  the  photon  will  hit  some  small 
portion  dS  of  the  surface  of  the  screen. 


PART  SIX  ATOMIC 
AND  MOLECULAR  PHYSICS 


CHAPTER  38  ELEMENTS  OF  QUANTUM  MECHANICS 


38.1  Wave-Particle  Dualism  of  the  Properties 
of  Particles  of  Matter 

38.1.1  Quantum  mechanics*  deals  with  the  physics  of  atoms, 
molecules  and  their  associations,  crystals  in  particular,  as  well 
as  atomic  nuclei  and  elementary  particles.  Objects  of  the  micro¬ 
cosm  studied  by  quantum  mechanics  have  linear  dimensions 
of  the  order  of  10~6  to  10~13  cm.  If  the  particles  travel  at  veloci¬ 
ties  K<t,  where  c  is  the  velocity  of  light  in  free  space,  nonrel- 
ativistic  quantum  mechanics  is  applied;  at  v  c,  relativistic 
quantum  mechanics  is  resorted  to.** 

38.1.2  Quantum  mechanics  is  based  on  Planck’s  concepts  con¬ 
cerning  the  discrete  nature  of  changes  in  the  energy  of  atoms 
(36.3.2),  Einstein’s  ideas  on  photons  (37.1.4),  and  data  on  the 
quantum  nature  of  certain  physical  quantities  (momentum  and 
energy,  for  instance)  that  characterize,  under  definite  condi¬ 
tions,  the  states  of  particles  of  the  microcosm. 

38.1.3  The  basic  concept  of  quantum  mechanics  is  that  the  wave- 
particle  dualism  of  properties,  established  for  light  (37.4.1), 
is  of  a  universal  nature.  It  should  manifest  itself  for  any  parti¬ 
cles  having  a  momentum  p.  All  particles  with  the  finite  momen¬ 
tum  p  possess  wave  properties  and  their  motion  is  accompanied 
by  a  certain  wave  process. 


*  Also  called  wave  mechanics  in  its  early  stages  of  develop¬ 
ment. 

**  Information  on  relativistic  quantum  mechanics  is  beyond 
the  scope  of  the  present  handbook.  Wherever  applied,  the  term 
quantum  mechanics  refers  fp  n.qnrejativistic  quantum  mechanics, 
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38.1.4  The  de  Broglie  formula  establishes  the  dependence  of 
the  wavelength  associated  with  a  moving  particle  of  matter 
on  the  momentum  /;  of  the  particle.  Thus 

h  h 
p  mv 


where  m  is  the  mass  of  the  particle,  v  is  its  velocity,  and  h  is 
Planck’s  constant  (Appendix  II).  The  waves  mentioned  here 
are  called  de  Broglie,  or  matter,  waves. 

Another  form  of  the  de  Broglie  formula  is 


P=- 


hk, 


where  k  =  2jtn/A.  is  the  wave  vector,  whose  magnitude  k  = 
=  2jx/X  is  the  wavenumber,  i.e.  the  number  of  wavelengths 
per  2it  units  of  length,  n  is  a  unit  vector  in  the  direction  of  wave 
propagation,  and  h  =  hl2n  =  1.05  X  10-34  J-s. 

38.1.5  The  de  Broglie  wavelength  for  a  particle  of  mass  m, 
having  the  kinetic  energy  E k  (3.2.1),  is 


1^  2m£’]C 


In  particular,  for  an  electron  accelerated  in  an  electric  field 
with  a  potential  difference  of  AV  volts  (16.2.7), 


1  = 


12.25  j 
/AF 


38.1.6  The  de  Broglie  formula  is  confirmed  by  experiments  in 
the  scattering  of  electrons  and  other  particles  by  crystals  and 
in  the  passage  of  particles  through  matter.  An  indication  of 
the  wave  process  in  all  such  experiments  is  the  diffraction  pat¬ 
tern  observed,  showing  the  distribution  of  the  electrons  (or  other 
particles)  in  the  particle  detecting  devices. 

No  wave  properties  are  manifested  by  macroscopic  bodies.  The 
de  Broglie  wavelengths  for  such  bodies  are  so  short  that  it 
proves  impossible  to  detect  any  wave  properties. 

38.1.7  The  phase  velocity  (30.2.10)  of  de  Broglie  waves  is 

(0  c*  ca 
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where  m  is  the  mass  of  the  particle,  v  is  its  velocity,  and  X  is 
the  de  Broglie  wavelength.  Since  c  >  v,  the  phase  velocity 
of  de  Broglie  waves  is  greater  than  the  velocity  of  light  in  free 
space.  The  dependence  of  the  phase  velocity  of  de  Broglie  waves 
on  the  wavelength  indicates  that  these  waves  undergo  disper¬ 
sion  (30.3.8).  The  group  velocity  (30.4.3)  of  de  Broglie  waves 
is  equal  to  the  particle  velocity  v.  Thus 

dw 


Table  38.1  compares  the  particle  and  wave  properties  of  parti¬ 
cles  of  mass  m,  travelling  at  the  velocity  v. 

TABLE  38.1 


Particlelike 

properties 


Wavelike  properties 


Velocity  v 

Momentum  p  = 
=  mv 


h  h 

De  Broglie  wavelength  X—  — = - 

p  mv 

E 

De  Broglie  wave  frequency  v  — — 


me2 


h 


Energy  £  =  mcs 


Group  velocity  of  de  Broglie  waves  u  =  v 
Phase  velocity  of  de  Broglie  waves  pph  = 

-£i 

—  V 


38.1.8  Besides  the  de  Broglie  formula,  it  is  assumed  in  quan¬ 
tum  mechanics  that  the  following  relation  exists  between  the 
total  energy  E  of  a  particle  and  the  frequency  v  of  its  de  Broglie 
wave: 

E  =  hv  —  ftco, 

where  to  =  2jiv  is  the  angular  frequency  (28.1.2)  and  h  — 
=  hi 2n  (38.1.4). 

38.1.9  De  Broglie  waves  are  of  a  specific  nature  having  no  analo¬ 
gies  among  the  waves  dealt  with  in  classical  physics.  The  square 
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of  the  magnitude  of  the  de  Broglie  wave’s  amplitude  at  a 
given  point  is  a  measure  of  the  probability  that  the  particle 
is  to  be  found  at  this  point  (this  is  the  probability,  or  statistical, 
meaning  of  de  Broglie  waves).  The  diffraction  patterns,  observed 
in  the  experiments  mentioned  in  Sect.  38.1.6,  are  an  indication 
of  statistical  laws.  According  to  these  laws  the  particles,  reach 
definite  places  in  the  receivers,  these  being  the  places  where 
the  intensity  (30.3.6)  of  the  de  Broglie  wave  is  found  to  be 
highest.  No  particles  are  observed  at  the  places  where,  accord¬ 
ing  to  the  statistical  interpretation,  the  square  of  the  magnitude 
of  the  “probability  wave”  amplitude  becomes  zero. 


38.2  Schrodinger  Wave  Equation 

38.2.1  The  position  of  a  particle  in  space  at  a  given  instant 
of  time  is  determined  in  quantum  mechanics  by  specifying  its 
Schrodinger  wave  function  (also  called  the  psi  function  or  the 
probability  amplitude )  i|i  (x,  y,  z,  l).  The  probability  dw  that 
the  particle  is  within  the  element  dV  of  volume  is  proportional 
to  |  ^  |2  and  to  the  element  dV  of  volume.  Thus 

dw  =  1 1|)  |2  dV, 

where  |  i|;  |2  is  the  square  of  the  magnitude  of  the  psi  function: 

|  i|i  |2  =  Here  i|i*  is  a  function  that  is  the  complex  conju¬ 
gate  of  i|>.  The  quantity  |  i|)  |2  is  the  probability  density  I  l2  == 
=  dw/dV  =  p  and  specifies  the  probability  of  the  particle  oc¬ 
cupying  the  given  point  in  space.  The  intensity  of  the  de  Brog¬ 
lie  wave  is  determined  by  the  quantity  |  i|)  |2. 

38.2.2  Following  from  the  definition  of  the  psi  function  is  the 
normalization  condition  for  probabilities: 

jj  [  m2^=i. 

in  which  the  triple  integral  with  respect  to  volume  is  calcu¬ 
lated  along  the  coordinates  x,  y  and  z,  from—  oo  to+°°,  i.e. 
throughout  the  whole  infinite  space.  The  normalization  condi¬ 
tion  indicates  that  it  is  a  certain  event  that  the  particle  is  some¬ 
where  in  space  and  the  probability  of  this  event  should  be  uni¬ 
ty- 

38.2.3  The  wave  function  ij?  (x,  y,  z,  t)  is  the  main  character¬ 
istic  of  the  state  of  microobjects  (atoms,  molecules  and  eie- 
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incntary  particles).  This  function  is  made  use  of  to  calculate  the 
average  value  of  some  physical  quantity  L  that  characterizes 
the  object  which  is  in  the  state  described  by  the  wave  function 
ih.  Thus 


<A>  = 


L 


| ij)  \2  dx  dy  dz, 


where  (L)  is  the  average  value  of  L  and  integration  is  carried 
out  as  indicated  in  Sect.  38.2.2. 

38.2.4  The  fundamental  differential  equation  of  quantum  me¬ 
chanics*,  concerning  the  wave  function  i|)  (x,  y,  z,  t)  is  called 
the  Schrddinger  time-dependent  wave  equation.  It  determines 
the  psi  function  for  microparticles  in  a  field  of  force  with  the 
potential  energy  U  (x,  y ,  z,  t)  (3.3.1)  at  the  velocity  c, 
where  c  is  the  velocity  of  light  in  free  space.  The  Schrodinger 
equation  is  of  the  form 

iRln=-l^^+U{x'  lJ'  2’ 

where  A  is  the  Laplace  operator,  m  is  the  mass  of  the  particle, 
h  =  hi 2ir,  h  is  Planck’s  constant,  and  i  =  ]/” — 1. 

The  Schrodinger  equation  is  supplemented  by  the  following 
conditions  that-are  imposed  on  the  psi  function: 

(a)  the  psi  function  should  be  finite,  single-valued  and  contin¬ 
uous; 

(b)  the  partial  derivatives  ch| ildx,  dxp/dy  and  dxp/dz  should  be 
continuous; 

(c)  the  function  1 1|>  |2  should  be  integrable,  i.e.  the  integral 

+  oo 

j  j  |  1 |2  dV  should  be  finite.  In  the  simplest  cases,  this  con- 

—  oo 

dition  is  reduced  to  the  normalization  of  probabilities  (38.2.2). 

38.2.5  In  the  case  when  the  psi  function  is  time-independent, 
i.e.  T  =  i|;  ( x ,  y,  z),  it  satisfies  the  Schrodinger  steady-state 


*  See  the  secopd  footnote  to  Sect.  38.1.1, 
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equation: 

+  (E-U)  if  =  0, 

where  E  is  the  total  energy  of  the  particle.  The  remaining  nota¬ 
tion  is  given  in  Sect.  38.2.4.  The  psi  functions  satisfying  the 
Schrodinger  equation  for  the  given  form  of  11=  U  (x,  y,  z) 
are  called  eigenfunctions.  They  exist  only  at  definite  values  of  E, 
called  the  eigenvalues  of  the  energy.  The  whole  set  of  eigenvalues 
of  E  forms  the  energy  spectrum  of  the  particle.  If  U  is  a  monotonic 
function  and  U  -*■  0  at  infinity,  then  in  the  region  E  <;  0,  the 
eigenvalues  of  the  energy  form  a  discrete  spectrum.  The  most 
important  problem  of  quantum  mechanics  is  to  find  the  eigen¬ 
values  and  eigenfunctions  of  particles  (or  systems  of  particles). 
38.2.6  Schrodinger’s  time-dependent  equation  has  the  solu¬ 
tion: 

if  (x,  y,  z,  I)  =  if  (x ,  y,  z)e~lFA,h. 

The  state  of  the  particle  at  the  given  instant  of  time  is  described 
by  a  periodic  function  of  time  with  the  angular  frequency 
co  =  Elh,  determined  by  the  total  energy  E  of  the  particle. 
This  corresponds  to  the  relation  between  the  total  energy  E  of 
the  particle  and  the  de  Broglie  wave  frequency  (38.1.8). 

If  a  particle  is  in  a  definite  energy  state  with  the  energy  E  = 
=  const,  the  probability  Aw  of  finding  it  in  the  element  AV  of 
volume  is  independent  of  time.  Thus 

dw  =  |  if  [2  AV  =  iftf*  dV. 

This  is  said  to  be  the  steady  state  of  the  particle.  An  atom  in  the 
steady  state  has  constant  energy  and  does  not  radiate  electro¬ 
magnetic  waves  (39.1.7). 


38.3  Motion  of  a  Free  Particle 

38.3.1  In  free  motion  of  a  particle  (U  =  0),  its  total  energy  E 
coincides  with  its  kinetic  energy.  If  the  OX  axis  is  taken  along 
vector  v  of  the  particle’s  velocity  (v  =  const),  Schrodinger’s 
steady-state  equation  (38.2.4)  has  the  following  solution: 

•^==Ae~  V^rnE  x  V2mE  x  ^ 
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where  m  is  the  mass  of  the  particle,  h  =  h/2n,  h  is  Planck’s 
constant,  and  A  and  B  are  certain  constants.  The  Schrodinger 
time-dependent  equation  (38.2.4)  has,  in  this  case,  the  solution 

.  /  E.  V  2mE  x  \  .(  E  .  V  2.mEx  \ 

-*  VW4 - 1 - /  -H-t*4+' — i - ) 

i|>  (x,  y,  r,  t)  —Ae  h  fl  -(-Be  h  , 

which  is  the  superposition  of  two  plane  monochromatic  waves 
(31.1.6)  of  equal  frequency  u>  =  Elh,  propagating,  one  in  the 
positive  direction  of  the  OX  axis  with  the  amplitude  A,  and 
the  other  in  the  opposite  direction  with  the  amplitude  B. 
38.3.2  In  quantum  mechanics  a  free  particle  is  described  by 
a  plane  monochromatic  de  Broglie  wave  with  the  wavenumber 
k  (38.1.4): 

Ic -j-  Y  2  m  E . 

The  probability  of  finding  the  particle  at  any  point  in  space  is 
constant.  For  waves  propagating  along  the  positive  direction 
of  the  OX  axis, 

|  |2  =  i|nji*  =  ]  A  |2. 


28.4  A  Particle  in  a  One-Dimensional  Infinitely 
Deep  Potential  Well 

38.4.1  A  potential  well  is  a  region  of  space  in  which  the  poten¬ 
tial  energy  U  of  a  particle  is  less  than  a  certain  value  UmaK. 
In  particular,  at  U  =  U  ( x )  and  Umax  =  oo,  there  is  a  one¬ 
dimensional  potential  well  of  infinite  depth.  If  the  potential 
energy  of  the  particle  outside  and  inside  the  potential  well 
has  the  following  values  (Fig.  38.1): 

U  =  0  at  0 

U  =  oo  at  0  and  T, 
the  well  has  a  “flat  bottom”. 

The  motion  of  collectivized  electrons  in  a  metal  is  dealt  with 
in  classical  electronic  theory  as  motion  in  a  potential  well, 
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the  potential  energy  of  the  electron  outside  the  metal  equalling 
zero,  and  that  inside  the  metal  being  negative  and  numerically 
equal  to  the  work  function  of  the  metal  (energy  required  to  re¬ 
move  an  electron  from  the  metal)  (41.11.1).  The  potential  well 
illustrated  in  Fig.  38.1  is  simpler 
U  i  I  than  the  real  potential  well  of  elec- 

J  ]  trons  in  a  metal. 

38.4.2  The  Schrodinger  steady-state 
equation  (38.2.5)  for  particles  in  a 
potential  well,  considered  in  the  pre¬ 
ceding  subsection,  is  of  the  form 


x=0  x=b  X 


dxt 


2m 


E  ij)=  0 


Fig.  38.1  at  tjje  Found  ary  conditions  ijj  (0)  = 

=  i|)  (L)  =  0,  which  indicate  that 
\|)  =  0  and  |  ih  |2  =  0  outside  the  region  0<  L,  i.e. 
that  the  probability  of  finding  the  particle  outside  the  potential 
well  is  zero. 

The  solution  of  the  Schrodinger  equation  is 


i[)  ( x )  =  A  cos  kx  +  B  sin  kx, 

where  A  and  B  are  constants,  and  k  =  -  ~\f  2mElh  is  the  wavenum¬ 
ber  (38.1.4).  It  follows  from  the  boundary  conditions  that  A  —  0, 
B  ^  0  and  sin  kL  =  0,  i.e.  the  wavenumber  can  assume  a  set 
of  discrete  values  that  comply  with  the  requirement:  knL  = 
=  nn,  where  n  —  1,  2,  3,  ...  . 

The  last  equation  means  that 


2n 

K 


or 


A  whole  number  of  de  Broglie  half-waves  should  fit  into  the 
length  of  the  potential  well. 

38.4.3  Physical  quantities  that  can  have  only  definite  discrete 
values  are  said  to  be  quantized  ( quantized  physical  quantities). 
The  eigenvalues  of  the  energy  En  of  a  particle  in  a  one-dimen¬ 
sional  infinitely  deep  potential  well, 


En 


2  mi2 


(«  =  1.  2,  ...), 


2  ml2 
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comprise  a  discrete  set  of  energy  values  that  are  quantized  (see 
also  Sect.  38.2.5). 

The  quantized  values  of  En  are  called  energy  levels,  and  the  num¬ 
bers  n,  determining  the  energy  levels  of  a  particle  in  a  poten¬ 
tial  well,  are  called  quantum,  numbers. 

38.4.4  At  large  quantum  numbers  (re>  1),  the  energy  levels 
of  a  particle  in  a  potential  well  become  relatively  closer  to  one 
another.  The  relation  A E/En  x  2 /re<  1,  where  A E  =  En+l  - 
—  En  =  (2 n  + 1)  n2h2/2mL 2.  The  inequality  A E  <  En  at 
rc »  1  indicates  that  the  quantization  of  energy  at  large  quan¬ 
tum  numbers  yields  results  close  to  those  obtained  in  classical 
physics:  the  energy  levels  become  quasi-continuous  (quasi- 
continuous  energy  levels  at  n^>  1). 

Bohr's  correspondence  principle  states  that  at  high  quantum  num¬ 
bers  the  conclusions  and  results  obtained  in  quantum  mechanics 
should  correspond  to  those  of  classical  mechanics. 

A  more  general  statement  of  the  correspondence  principle  is 
that  between  any  theory  developed  from  a  classical  one  and  the 
initial  classical  theory  there  must  be  a  regular  relationship: 
in  definite  limiting  cases,  the  new  theory  must  go  over  into  the 
old  one.  For  example,  the  formulas  of  kinematics  and  dynamics 
of  the  special  theory  of  relativity  become  the  formulas  of  New¬ 
ton’s  mechanics  at  velocities  where  (a/c)a<  1  (5.3.4).  Geometri¬ 
cal  optics  is  a  limiting  case  of  wave  optics  if  the  wavelength 
can  be  neglected  (>.  0). 


38.5  Linear  Harmonic  Oscillator 


38.5.1  A-  linear  ( one-dimensional )  harmonic  oscillator  is  a  particle 
of  mass  m,  vibrating  with  the  natural  angular  frequency  co0 
(28.1.2)  along  a  certain  axis  OX  due  to  the  action  of  quasi-elas¬ 
tic  force  F,  which  is  proportional  to  the  displacement  x  of 
the  particle  from  its  equilibrium  position.  Thus  F  =  —lex. 
Here  Jc  is  a  factor  of  the  quasi-elastic  force,  related  to  m  and 
co0  by.  the  equation:  k  —  m cog  (28.2.3).  The  potential  energy  U 
of  the  harmonic  oscillator  (28.2.3)  is 
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38.5.2  In  classical  physics  the  amplitude  (28.1.3)  of  small  vibra¬ 
tion  of  a  harmonic  oscillator  is  determined  by  the  margin  of  its 
total  energy  E  (Fig.  38.2).  At  points  B  and  A,  with  the  coordi¬ 
nates  ±a,  the  total  energy  E  is  equal  to  the  potential  energy: 

E  =  {  H  ,  where  a  is  the  amplitude  of  vibration  of  the  clas- 
l U  (• — a) 


Fig.  38.2 


Fig.  38.3 


sical  harmonic  oscillator.  Such  an  oscillator  cannot  get  outside 
the  limits  of  the  region  ( — a,  +  a). 

38.5.3  The  probability  wc\  (x)  dr  of  finding  the  oscillator  on 
the  line  segment  from  x  to  x  +  dx  in  accordance  with  classical 
mechanics. 


wC]  ( x )  dx 


Jt  a 


t1 


dx 

_£*  \l/2  ’ 

a 2  ) 


is  represented  by  the  curve  in  Fig.  38.3. 

38.5.4  The  vibration  of  a  linear  harmonic  oscillator  is  investi¬ 
gated  in  quantum  mechanics  by  means  of  the  Schrodinger 
steady-state  equation  (38.2.5): 


d2i|) 

dx2 


(E 


»10>§£2 

2 


The  solutions  of  this  equation,  complying  with  the  conditions 
in  Sect.  38.1.4,  are  the  eigenfunctions  (wave  functions)  of  a  lin¬ 
ear  harmonic  oscillator: 

1 


X 
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where  x0  =  hl2nmw0,  H n(|)  is  a  Chcbyshev-Hcrmite  poly¬ 
nomial  of  the  reth  order : 


(-1)" 


,n 

P.  » 


Y?*n\  Ya  d^n  ' 

The  eigenfunctions  for  n  =  0,  1,  2  are: 
1 


i|)0  (x) 
Ti  (*) 


-x2/2x2 


"jA  Jl 

1 


2x  -X2/2X2 
- e  ; 


]/  2x0  Y  n  ‘T| 

*.(*>  =  ,-i—  (4-g--2) 
y  8x0  ji  '  °  ’ 


-x-/2xi 


The  node  o/  a  iraoe  function  is  its  value  equal  to  zero.  The  num¬ 
ber  of  nodes  of  the  function  i|>n  is  equal  to  the  quantum  number  n 
(38.4.3). 

38.5.5  The  eigenvalues  of  the  energy  En  of  a  linear  harmonic 
oscillator, 

En=  (”  +  y)  >v0=(n-(--i  )  Sco0  (re  =  0,  1,  2,  ...), 

where  v0  =  io0/2ji,  and  co0  is  the  natural  angular  frequency 
(38.5.1),  are  the  set  of  equally  spaced  energy  levels  shown  in 
Fig.  38.4.  At  1  (where  n  -f-  1/2  sj  n),  the  energy  levels  of 
the  oscillator  coincide  with  the  values  of  quantized  energy  of 
the  oscillator  E’n  =  nhu>0,  postulated  by  Planck  in  the  theory 
of  blackbody  radiation  (36.1.5). 

38.5.6  The  minimum  energy  that  can  be  had  by  a  harmonic 
oscillator  is  called  the  zero-point  energy  E0: 

v  _  hvo  _  ..  A\ 

E0  — — £ —  —  — 2 —  (at  n  —  u). 

In  classical  physics  and  in  Planck’s  theory  it  was  assumed  that 
Eg  =  0  (at  n  =  0).  This  means  that  the  oscillator  does  not  vi¬ 
brate  and  is  in  a  state  of  equilibrium.  Atoms,  as  oscillators, 


35-0186 
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should  not,  according  to  classical  physics,  execute  any  vibra¬ 
tions  at  the  temperature  of  absolute  zero  ( T  =  0).  It  has  been 


Fig.  38.4  Fig.  38.5 

proved  in  modern  quantum  mechanics,  that  the  zero-point  ener¬ 
gy  of  a  harmonic  oscillator  cannot  be  reduced  however  much  it 


Fig.  38.6 

is  cooled,  even  down  to  absolute  zero  (11.8.4).  Zero-point  energy 
of  the  oscillator  corresponds  to  its  zero  vibration.  In  quantum 
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mechanics,  zero-point  energy  is  a  typical  feature  of  any  system 
of  particles.  At  temperatures  close  to  absolute  zero,  a  substance 
is  in  a  condensed  state  and  its  atoms  (or  molecules  or  ions)  are 
dealt  with  as  vibrating  oscillators.  Zero-point  energy  is  the 
least  amount  of  energy  that  a  quantum  oscillator  should  have 
in  its  lowest  energy  state  (at 
n  ---  0)  in  order  to  comply  - 
with  the  indeterminacy  relation 
(38.6.2). 

38.5.7  The  probability  of  find¬ 
ing  a  quantum  linear  harmon¬ 
ic  oscillator  on  the  OX  axis  in 
the  region  from  x  to  x  -f-  dx  is 

fi’qu  (x)  dx  =  |  (x)  |2  dx. 

Compared  in  Fig.  38.5  are  the 
quantum  probability  density  at  Fig.  38.7 
n  =  1  and  the  classical  probabi¬ 
lity  density  wc ]  (x).  The  existence 

of  nonzero  values  of  u\IU  (x)  beyond  the  limits  of  the  classical¬ 
ly  allowed  region  |  x  |  >  a  is  explained  by  the  feasibility  of 
l  he  passing  of  particles  possessing  wavelike  properties  through 
a  potential  barrier  (38.7.2). 

38.5.8  As  the  number  n  is  increased,  the  probability  distribu¬ 
tion  curve  |  ajjn.  |2,  illustrated  in  Fig.  38.6  for  n  =  10,  becomes 
more  and  more  like  the  classical  probability  distribution  curve 
(Fig.  38.3).  This  agrees  with  Bohr’s  correspondence  principle 
(38.4.4). 

38.5.9  Making  use  of  the  eigenfunctions  of  the  energy  of  a  linear 
harmonic  oscillator,  the  average  energy  ( E )  of  such  an  oscil¬ 
lator  can  be  calculated: 


i _ rm« 

2  '  exp  (fuo0lkT)  —  1 


2 


(Et). 


Accurate  to  within  zero-point  energy,  ( E )  is  expressed  by  the 
term  {E j),  which  was  obtained  by  Planck  in  developing  his 
theory  of  the  thermal  radiation  of  a  blackbody.  Graphically, 
this  term  is  represented  in  Fig.  38.7,  at  T  =  const,  as  a  func¬ 
tion  of  the  frequency.  It  is  evident  that  the  greatest  contribu¬ 
tion  to  {Ex)  is  made  by  vibrations  with  low  frequencies,  corre¬ 
sponding  to  long  wavelengths. 
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At  a  high  temperature  (kT^>  hu>0) 


<EX>  = 


B(Op 

^<*>0 


—  l 


=  a-7\ 


This  result  coincides  with  that  obtained  from  the  principle  of 
equipartition  of  energy  (10.6.4). 


38.6  Heisenberg  Indeterminacy  Principle 

38.6.1  The  wavelike  properties  of  microparticles  (38.1.4) 
introduce  limitations  into  the  possibility  of  applying  the  con¬ 
cepts  of  coordinates  and  momentum  in  their  classical  form  to 
such  particles. 

Limitations  also  exist  in  classical  physics  in  the  application  of 
certain  concepts  to  definite  items.  It  is  of  no  meaning,  for  in¬ 
stance,  to  apply  the  idea  of  temperature  to  a  single  molecule; 
the  concept  of  point  localization  (location  at  a  certain  point) 
is  inapplicable  in  determining  the  position  of  a  wave  in  space, 
etc.  But  in  classical  mechanics,  a  definite  value  of  the  particle’s 
coordinate  corresponds  to  exact  values  of  its  velocity  and  mo¬ 
mentum.  In  quantum  mechanics  there  exist  fundamental  re¬ 
strictions  in  the  possibility  of  simultaneously  determining  the 
exact  coordinate  of  a  particle  and  its  exact  momentum.  These 
restrictions  are  associated  with  the  wave-particle  dualism  of 
the  properties  of  microparticles  (38.1.3). 

38.6.2  The  Heisenberg  indeterminacy  relations  are  the  inequali¬ 
ties 

AzXA px>y,  AyXAp„>y,  AzxA pz>y. 

Here  Ax,  Ay  and  Az  are  the  ranges  of  the  coordinates  within 
which  a  particle,  described  by  a  de  Broglie  wave  (38.1.4),  may 
be  localized  if  the  projections  of  its  momentum  along  the  coor¬ 
dinate  axes  are  within  the  ranges  A px,  A py  and  A pz.  The  Hei¬ 
senberg  relations  indicate  that  the  coordinates  x,  y,  and  z  of 
the  particle  and  the  projections  px,  py  and  pz  of  its  momentum 
on  the  corresponding  axes  cannot  simultaneously  have  values 
exactly  equal  to  x  and  px,  y  and  p;/,  and  z  and  pz.  These  physical 
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<1  uantities  can  have  values  given  to  an  accuracy  determined  by 
I  lie  Heisenberg  relations.  The  more  accurately  the  position  of 
the  particle  is  determined,  i.e.  the  smaller  the  values  of  At, 
A;/  and  A z,  the  less  accurately  the  values  are  determined  for  the 
projections  of  its  momentum  (i.e.  the  greater  are  A px,  A py 
and  Ap2).  If  the  position  of  the  particle  along  the  OX  axis  is 
exactly  determined  and  At  =  0,  then  A px  —  oo  and  the  value 
o!  the  projection  px  of  the  momentum  on  this  axis  becomes  en- 
t  i roly  indeterminate. 

98.6.3  The  indeterminacy  relations  put  definite  restrictions  in 
quantum  mechanics  on  the  feasibility  of  describing  the  motion 
of  a  particle  along  a  certain  path. 

In  classical  theory  each  point  of  the  path  of  a  particle  has  de¬ 
finite  coordinates  t,  ij,  and  z  and  the  particle  has  a  definite  mo¬ 
mentum  p  with  projections  px,  py  and  p2  along  the  coordinate 
axes.  This  is  realized  in  quantum  mechanics  only  in  cases  when 
I  lie  particle  travels  in  a  macroscopic  region  of  space  (for  instance, 

I  lie  particle  may  leave  a  trace  on  a  photographic  plate  or 
on  the  screen  of  an  oscillograph).  If,  for  example,  the  position 
of  an  electron  is  fixed  with  an  accuracy  determined  by  the  linear 
dimensions  of  a  grain  of  photographic  emulsion,  subject  to  the 
elTect  of  the  electron.,  then  At  »  10~6  m.  This  corresponds  to 
an  indeterminacy  in  the  momentum  equal  to  Apxi>  h/2Ax  ~ 
~  10-28  kg-m/s  and  in  the  velocity  equal  to  A ux  =  bpjm  « 
102  m/s.  This  indeterminacy  at  electron  velocities  of  the  order 
of  106  to  107  m/s, enables  us  to  assume  that  the  electron  is  trav¬ 
elling  along  a  definite  path  with  an  exact  velocity  specified 
at  each  point  of  this  path. 

38.6.4  If  the  particle  is  travelling  in  a  microscopic  region  of 
space,  the  indeterminacy  relations  substantially  affect  the  nature 
of  its  motion.  For  example,  the  position  of  an  electron  trav¬ 
elling  in  an  atom  can  be  determined  with  an  accuracy  to  the 
dimensions  of  the  atom,  i.e.  At  «  lCH0  m.  But  the  indeter¬ 
minacy  of  the  electron’s  velocity  Aux,  in  this  case,  is  found  to 
be  of  the  same  order  of  magnitude  as  the  velocity  itself:  Aux  ss 
s;  10c  m/s  »  v.  The  path  of  an  electron  in  an  atom  with  the 
exact  velocity  specified  at  each  point  does  not  have  the  meaning 
it  has  in  classical  mechanics  (1.2.3).  This  by  no  means  indicates 
I  hat  the  indeterminacy  principle  places  fundamental  limitations 
on  our  knowledge  of  the  microcosm.  It  only  indicates  the  limited 
applicability  of  the  concepts  of  classical  physics  to  the  region 
of  the  microcosm. 
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38.6.5  No  restrictions  are  made  by  the  indeterminacy  principle 
with  respect  to  the  feasibility  of  applying  the  conception  of  the 
coordinate  and  momentum  of  a  macroscopic  body  in  the  classi¬ 
cal  sense.  The  wave  properties  of  such  bodies  are  not  manifested 
in  any  way  (38.1.6)  and,  consequently,  the  indeterminacy  rela¬ 
tions  are  not  applied  to  them. 

38.6.6  The  indeterminacy  relations  for  energy  E  and  time  t  are 


A  EX  A  h. 


where  A E  is  the  indeterminacy  of  the  energy  of  a  particle,  which 
is  in  a  state  with  the  energy  E  during  the  length  of  time  A t. 
The  longer  a  particle  is  in  the  given  state,  the  more  accurately 
its  energy  in  this  state  can  be  determined. 

38.6.7  Measurement  is  the  process  of  interaction  of  an  instru¬ 
ment  with  the  item  being  investigated  as  a  result  of  which  cer¬ 
tain  information  is  obtained  on  the  properties  of  the  item.  This 
process  proceeds  in  space  and  time  and  is  an  objective  process. 
The  interaction  between  the  measuring  instrument  and  the  item 
being  measured  essentially  differs  for  microscopic  and  macrosco¬ 
pic  items.  In  the  latter  case,  the  measuring  process  is  described 
with  more  or  less  accuracy  by  the  laws  of  classical  physics,  and 
the  instrument  has  no  effect  on  the  item  being  measured  that 
cannot  be  accurately  taken  into  account  in  the  terms  and  con¬ 
cepts  of  classical  physics  or  reduced  to  as  small  a  value  as  de¬ 
sired.  In  quantum  mechanics,  owing  to  the  objectively  existing 
duality  of  the  properties  of  microscopic  items  (38.1.3),  the  meas¬ 
uring  process  is  inevitably  associated  with  substantial  effects 
of  the  instrument  on  the  course  of  the  phenomenon  being  investi¬ 
gated.  For  example,  to  determine  the  position  of  the  electron  it 
must  be  “illuminated”  by  light  of  the  highest  possible  frequency. 
As  a  result  of  the  collision  between  the  photon  and  the  electron, 
the  momentum  px  of  the  latter  is  changed  by  an  amount  deter¬ 
mined  by  the  indeterminacy  relation:  A hi 2 A*  (38.6.2).  This 
effect  on  the  item  being  measured  cannot  be  considered  small 
or  inessential  because  the  state  of  the  item  is  changed.  This 
change  is  such  that  as  a  result  of  measurement,  definite  classical 
characteristics  of  a  particle,  for  instance,  its  momentum,  are 
found  to  be  specified  only  within  the  limits  restricted  by  the 
indeterminacy  relations, 
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38.7  Tunnel  Effect 

38.7.1  Potential  wells  in  which  particles  are  located  may  be  of 
much  more  complicated  shape  than  those  dealt  with  in  Sect.  38.4. 
If  the  potential  energy  of  the  particle  is  of  the  form  U  =  V  (r), 
as  shown  in  Fig.  38.8,  the  particle  moves  in  a  field  of  forces  that 


Fig.  38.8 


can  be  represented  in  the  form  of  a  potential  barrier.  This  means 
that  the  forces  act  on  the  particle  in  a  certain  limited  region  of 
space.  Outside  of  this  region  the  particles  move  as  free  ones. 
The  height  H  of  the  barrier  and  its  width  a  for  a  given  particle 
depend  upon  the  energy  E  of  the  particle.  In  order  for  a  particle 
to  get  out  of  a  'potential  well  or  to  enter  it  from  outside,  it  is 
necessary,  according  to  classical  mechanics,  to  impart  energy 
to  the  particle  equal  to  or  greater  than  |  f/max  —  E  |  =  II. 
38.7.2  The  tunnel  effect  is  the  passing  (“leaking”)  of  particles 
through  potential  barriers.  The  tunnel  effect  is  a  quantum-mecha¬ 
nical  one  and  is  due  to  the  wavelike  properties  of  the  particles 
(38.1.3). 

The  penetrability  (penetrability  factor)  D  of  a  potential  barrier  is 
the  quantity 

n  _  ^tran 

~  Zinc  ’ 

where  /(ran  is  the  intensity  (30.3.6)  of  de  Broglie  waves  passing 
through  the  potential  barrier,  and  1  lnc  is  the  intensity  of  the 
waves  incident  on  the  barrier. 


552 


ELEMENTS  OF  QUANTUM  MECHANICS  [38 


The  penetrability  of  a  rectangular  potential  barrier  of  the  height 
U0  and  width  L  (Fig.  38.9)  is 

D  =  D0  exp  — -|-Vr2 m  (£/„  —  £)£ ”j, 

where  in  is  the  mass  of  the  particle,  and  E  is  its  energy. 

For  a  potential  barrier  of  complex  shape 

X2 

£>  =  £>„  exp  | - \  \  V 2m  lu(x)  —  E]  dx)’ 

X 1 

where  xr  and  x2  are  the  coordinates  of  the  points  at  the  beginning 
and  end  of  the  potential  barrier  U  ( x )  for  a  given  value  E  of 


Fig.  38.9 


Fig.  38.10 


the  total  energy  (Fig.  38.10).  In  these  formulas  D0  is  a  constant 
factor  close  to  unity. 

38.7.3  The  tunnel  effect  plays  an  important  role  when  the  pene¬ 
trability  is  not  too  low.  This  occurs  in  the  cases  when  the  linear 
dimensions  of  the  potential  barrier  are  commensurable  with 
atomic  dimensions.  For  example,  at  t/0  —  E  =  10  eV  for  an 
electron  (m  «  10~30  kg)  and  L  —  10_1°  m,  we  obtain  D  x  1/e. 
Under  the  same  conditions,  excepting  that  L  =  10~2  m ,  D  x 
»  exp  ( — 108).  The  penetrability  of  the  barrier  decreases  with 
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an  increase  in  the  mass  of  the  particle  and  in  the  difference 
»«  -E. 

38.7.4  The  paradox  of  Ihe  tunnel  effect  is  that  the  passing  of 
a  particle  through  the  potential  barrier  enables  it  to  be  found 
in  a  region  that  is  forbidden  from  the  classical  point  of  view. 
In  this  region  the  potential  energy  of  the  particle  exceeds  its 
total  energy;  its  kinetic  energy  is  negative  and,  consequently, 
the  velocity  (momentum)  of  the  particle  becomes  an  imaginary 
quantity. 

Actually  no  such  paradox  exists.  The  tunnel  effect  is  a  purely 
quantum-mechanical  phenomenon  and  here  an  unexpected 
difficulty  arises,  from  the  classical  point  of  view,  in  represent¬ 
ing  the  total  energy  E  in  the  form  of  the  sum  of  the  kinetic 
and  potential  energies:  E  =  p2l2m  +  U  (x).  Owing  to  the  fact 
that  the  indeterminacy  relation  (38.6.2)  excludes  the  possibility 
of  simultaneously  specifying  definite  values,  to  any  degree  of 
accuracy,  for  the  coordinate  x  and  momentum  p  of  a  particle, 
it  is  found  to  be  inadmissible  in  quantum  mechanics  to  repre¬ 
sent  the  total  energy  as  the  sum  of  two  exactly  definite  parts: 
t  he  kinetic  p2l2m  and  potential  U  (x)  energies.  If  the  coordina¬ 
te  x  of  the  particle  is  fixed  in  the  region  Ex  and  its  potential 
energy  U  (x)  is  determined  with  sufficient  accuracy,  then  the 
indeterminacy  A p  Introduced  into  the  value  of  the  particle’s 
momentum  (A  p  »  hi  Ax)  excludes  the  possibility  of  specifying 
the  exact  value  of  the  kinetic  energy  p2l2m  of  the  particle. 
Consequently,  the  previous  value  of  the  particle’s  total  energy  E 
cannot  be  specified  either.  It  is  found,  in  this  case,  that  the  change 
in  the  kinetic  energy  A E\,  caused  by  fixing  the  particle’s 
coordinate,  exceeds  the  difference  between  the  barrier  height 
U0  (Tig.  38.9)  and  the  total  energy  E  of  the  particle: 

A Ek  >  (/„—  E. 


Thus,  A E\  exceeds  the  energy  that  the  particle,  inside  the  poten¬ 
tial  well,  lacks  to  surmount  the  barrier  “in  the  classical  way” 
to  get  out  of  the  well. 

The  role  played  by  the  tunnel  effect  in  solid-state  physics  and 
in  the  phenomenon  of  radioactive  decay  is  dealt  with  in  Sects. 
41.8.3  and  42.5.2. 
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CHAPTER  39  STRUCTURE  OF  ATOMS 

AND  MOLECULES  AND  THEIR 
OPTICAL  PROPERTIES 


39.1  The  Hydrogen  Atom  and  Hydrogen-Like  Ions 


39.1.1  An  atom  is  the  smallest  particle  of  matter  possessing  all 
the  chemical  properties  of  the  given  chemical  element.  An  atom 
consists  of  a  positively  charged  nucleus  and  electrons  moving 
about  in  the  electric  field  of  the  nucleus.  The  nuclear  charge  Ze 
(42.1.2)  is  equal  in  magnitude  to  the  total  charge  of  all  the  elec¬ 
trons  of  the  atom.  An  ion  is  the  electrically  charged  particle 
formed  when  an  atom  or  molecule  loses  or  gains  electrons. 

39.1.2  The  simplest  atomic  system  is  the  hydrogen  atom, 
consisting  of  a  single  proton  in  the  nucleus  and  a  single  electron 
moving  in  the  Coulomb  field  of  the  nucleus.  Ilydrogen-like  ions 
(isoelectronic  with  hydrogen )  are  such  as  He +,  Li+++  and  Be++, 
which  also  have  a  nucleus  with  the  charge  Ze  and  a  single  elec¬ 
tron. 

39.1.3  Among  the  optical  properties  of  atoms,  the  most  impor¬ 
tant  is  their  emission  spectrum.  The  frequency  v  of  the  lines 
in  the  discrete  line  spectrum  of  hydrogen  is  described  by  the 
Balmer-Rydberg  formula: 


v 

where 


eR'  (~ — K-)  =  *  (-4  — -V), 

\  re2  m‘  !  \  n-  m 2  / 


/?  =  ■ 


8ejjh3 


(in  SI  units), 


2n2mei  . 

/?  =  ■ — ,  „ .  (in  Gaussian  units). 

h 3 


Hero  c  is  the  velocity  of  light  in  free  space,  m  is  the  mass  of  the 
electron,  e  is  the  electron  charge,  h  is  Planck’s  constant,  and 
e0  is  the  electric  constant  in  SI  units  (14.2.7). 

Quantities  7?  and  /?'  =  R/c  are  called  the  Rydberg  constant  in 
s-1  and  in  cm-1  or  m_1,  respectively:  R  —  3.2931193  X  1015  s_1 
and  R'  =  1.0973531  X  107  m-1. 
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The  whole  numbers  re  and  m  are  called  the  principal  quantum 
numbers,  and  m  =  n  +  1,  n  +  2,  etc.  A  group  of  lines  having 
I  he  same  n  value  is  called  a  series.  The  hydrogen  spectrum  has 
the  following  series:  Lyman  series  at  n  =  1,  B aimer  series  at 
n  2,  Paschen  series  at  n  =  3,  Brackett  series  for  n  =  4,  Pfund 
series  for  n  —  5  and  the  Humphreys  series  for  n  =  6. 

For  hydrogen-like  ions  (39.1.1)  the  Balmer-Rydberg  formula  is 


Z2R  (± - L) 

U!  m2) 


where  Z  is  the  atomic  number  of  the  element  in  the  Mendeleev 
periodic  table  (39.3.5). 

39.1.4  Each  value  of  the  quantum  number  n  in  the  Balmer- 
liydberg  formula  corresponds  to  the  series  limit  with  the  maxi¬ 
mum  frequency  (at  m  —  oo),  called  the  term  Tn.  Thus 

R  Z2R 

Tn  =  (for  hydrogen),  Tn  —  (for  hydrogen-like  ions). 


39.1.5  The  energy  of  a  hydrogen-like  ion  in  a  state  with  the 
principal  quantum  number  n  is: 


E 


n  — 


Z2Rh 


_  1  En  I 

n  h 


The  magnitude  En  is  called  the  binding  energy  of  the  electron 
in  the  atom.  The  minimum  value  El  (at  n  =  1)  corresponds 
to  the  ground,  or  normal,  state  of  the  atom  (39.1.9).  All  energy 
values  at  re  >  1  characterize  excited  states  of  the  atom.  The  most 
vital  distinguishing  feature  of  excited  states  is  the  finite  life¬ 
time  r  of  the  electron  in  these  states:  r  «  10~8  s.  In  the  ground 
state  of  the  atom,  isolated  from  all  external  effects,  t  is  unlimi¬ 
ted.  The  maximum  energy  value  2?max  =  0  as  re  oo  corre¬ 
sponds  to  ionization  of  the  atom  or  tore,  i.e.  to  the  removal  of  an 
electron  from  it. 

In  magnitude,  the  ionization  energy  is  equal  to  the  binding 
energy  of  an  electron  in  the  atom  (or  ion). 

The  ionization  potential  (16.2.6)  of  an  atom  of  hydrogen  or  of 
a  hydrogen-like  ion  in  the  state  with  the  principal  quantum 
number  re  is  equal  to  V  =  Z2Rh/en2,  where  e  is  the  magnitude  of 
the  charge  of  the  electron.  A  diagram  of  the  energy  levels  in  the 
hydrogen  atom  and  the  arrangement  of  its  series  of  spectral 
lines  are  given  in  Fig.  39.1, 


series 


Wavenumber,  cm 
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39.1.6  The  spectrum  and  the  energy  levels  of  the  hydrogen 
atom  were  first  interpreted  by  means  of  Bohr’s  postulates. 
Bohr's  first  postulate  (steady  states  postulate)  states  that  there  are 
a  set  of  stationary,  or  steady,  states  of  the  atom  in  which  it 
does  not  radiate  electromagnetic  waves.  The  steady  states  cor¬ 
respond  to  steady  orbits  (39.1.8)  along  which  electrons  travel 
with  acceleration,  but  no  light  is  radiated  (cf.  Sect.  31.3.4). 
Bohr’s  first  postulate  was  explained  by  quantum  mechanics 
(38.2.6). 

Bohr's  second  postulate  ( orbit  quantization  rule)  states  that  in 
the  steady  state  of  the  atom,  the  electron,  travelling  along  a 
circular  orbit,  has  quantized  values  of  its  angular  momentum 
that  comply  with  the  condition 

Lh  =  mvr  =  kli  (k  =  1,  2,  3,  .  .  .). 

Here  m  is  the  mass  of  the  electron,  v  is  its  velocity,  r  is  the  ra¬ 
dius  of  the  Mh  orbit,  and  h  =  h/2n.  The  whole  number  k  is 
equal  to  the  number  of  de  Broglie  wavelengths  for  the  electron 
(38.1.4)  per  length  of  the  circular  orbit:  2nr/X  =  2 nrmvlh  =  k. 
The  quantization  of  the  angular  momentum  in  quantum  mecha¬ 
nics  is  discussed  in  Sect.  39.1.8. 

Bohr's  third  postulate  (the  Bohr  frequency  condition)  states  that 
in  going  from  one  steady  state  orbit  to  another,  the  atom  either 
omits  or  absorbs  one  photon.  A  photon  is  emitted  when  the  atom 
passes  from  a  state  with  more  energy  to  one  with  less  energy. 
Upon  transition' in  the  opposite  direction,  a  photon  is  absorbed. 
The  energy  hv  of  the  photon  is  equal  to  the  difference  in  energy 
in  the  two  states.  Thus 

En  —  Em  =  hv. 

When  En  >  Em  a  photon  is  emitted;  when  En  <  Em  a  photon 
is  absorbed. 

In  quantum  mechanics,  Bohr’s  frequency  condition  follows  from 
the  theory  of  quantum  transitions  from  one  energy  state  to 
another.  Information  on  this  theory  is  beyond  the  scope  of  the 
present  handbook. 

39.1.7  Schri; dinger’s  steady-state  equation  (38.2.5)  for  the 
motion  of  an  electron  in  the  Coulomb  field  of  a  nucleus  with  the 
charge  Ze  (42.1.2)  is  of  the  form 
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where  U  (r)  =  — Ze2/4jie0r  is  the  potential  energy  of  an  electron 
located  at  the  distance  r  from  the  nucleus,  8„  is  the  electric  con¬ 
stant  in  SI  units  (14.2.7),  and  E  is  the  total  energy  of  the  elec¬ 
tron  in  the  atom,  which  it  is  necessary  to  determine  under  the 
assumption  that  the  wave  functions  o[)  comply  with  the  condi¬ 
tions  indicated  in  Sect.  38.2.4. 

39.1.8  A  solution  of  the  Schrodinger  steady-state  equation  for 
an  electron  in  a  centrally  symmetrical  Coulomb  field  of  the 
nucleus  yields  the  following: 

(a)  the  angular  momentum  of  the  electron  in  the  atom  is  quan¬ 
tized  according  to  the  formula: 

h=Y  1(1- 1-1)  h, 

where  the  orbital  quantum  number  l,  determining  the  angular 
momentum,  varies  within  the  limits:  1  =  0,  1,  .  .  .,  ( n  —  1), 
where  n  is  the  principal  quantum  number  (39.1.3); 

(b)  at  E  <  0,  when  the  electron  is  “bound”  in  the  atom,  its  mo¬ 
tion  is  periodic,  and  the  value  of  the  total  energy  E  is  quan¬ 
tized.  The  eigenvalues  En  (38.2.5)  are  determined  by  the  formula: 


E 


n  — 


E 


n  — 


Z2mel  1 
8A2e|  n2 


Z2Rh 

n2 


(in  SI  units), 


2n2Z2mex 

P 


Z2Rh 

n2 


(in  Gaussian  units). 


Here  R  is  the  Rydberg  constant  (39.1.3).  The  solution  of  the 
Schrodinger  equation  for  an  electron  in  a  hydrogen-like  ion 
leads  to  energy  levels  of  the  Balmer-Rydberg  type  (39.1.4). 
39.1.9  In  accordance  with  the  orbital  quantum  number,  the 
following  letters  have  been  assigned  to  designate  the  state  of 
the  electron  in  atoms: 

5-state  at  l  =  0,  p-state  at  l  —  1, 
d-state  at  l  =  2,  /-state  at  /  =  3,  etc. 

The  s-state  of  the  electron  in  the  hydrogen  atom  at  n  =  1  is 
called  the  ground  state  (see  also  Sect.  39.1.5).  This  state  is 
spherically  symmetrical.  The  wave  function  of  this  state  de¬ 
pends  only  upon  the  distance  r  of  the  electron  from  the  nucleus 
[i|>  =  i|:  (r)]  and  is  of  the  form 

x|)  =  ij)  (r)=  Ce~T,a\ 
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where  C  is  a  constant  determined  from  the  normalization  con¬ 
dition  for  probabilities  (38.2.2),  and  a0  is  the  first  Bohr  radius: 


an  — 


fine0h2  . 


0  me 2 

h 2 


-  (in  SI  units), 


a„= - 5-  (in  Gaussian  units). 

me 2 


In  quantum  mechanics,  the  electron  orbits  in  the  atom  are  con¬ 
sidered  to  be  the  locus  of  points  at  which  the  probability  of 
finding  the  electron  is  a  maximum.  In  particular,  for  the  s-state 
of  the  hydrogen  atom,  such  an  orbit  is  the  first  circular  Bohr 
orbit  of  a  radius  equal  to  a0. 


39.2  Space  Quantization 

39.2.1  Space  quantization  is  the  existence  of  definite  discrete 
orientations  in  space  of  the  angular  momentum  vector  L;  of 
the  electron.  This  phenomenon  has  been  proven  in  quantum  me¬ 
chanics.  The  only  allowed  orientations  of  L /  are  those  in  which 
the  projection  Ljz  of  vector  L;  along  the  direction  Z  of  the  ex¬ 
ternal  magnetic,  field  assumes  values  that  are  multiples  of  %. 
Thus 

Liz  =  mh, 

where  m  is  a  whole  number  called  the  magnetic  quantum  number 
and  assumes  the  values  m  =  0,  ±1,  ±2,  .  .  .,  ±1,  and  l  is 
the  orbital  quantum  number  (39.1.8).  Given  in  Fig.  39.2  is  the 
space  quantization  of  vectors  L;  for  electrons  in  the  p-  and 
d-states  (39.1.9). 

39.2.2  Since  the  angular  momentum  vector  of  the  electron  is 
proportional  to  the  orbital  magnetic  moment  of  the  electron 
(26.1.2),  it  follows  from  space  quantization  that  the  vector 
pm  of  the  orbital  magnetic  moment  of  the  electron  (26.1.2)  in 
the  external  magnetic  field  cannot  assume  arbitrary  orienta¬ 
tions. 
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39.2.3  It  was  experimentally  established  that  space  quantiza¬ 
tion  is  observed  in  atoms  having  a  single  outer-shell  (valence) 
electron  in  the  s-state  (where  l  =  0)  (39.1.9).  In  this  state  the 

atoms  lack  angular  momen¬ 
tum  ( L0  =  0;  see  Sect. 
39.1.8).  Space  quantization, 
detected  in  such  experi¬ 
ments,  pertained  to  electron 
spin,  and  confirmed  the  exist¬ 
ence  of  two  possible  spin 
orientations  in  the  external 
magnetic  field  (26.1.3). 

The  magnitude  of  the  spin 
angular  momentum  Lis  of 
the  electron  is  determined 
by  the  formula  (39.1.8): 

Lts=Vs  (*+l)  h, 

where  s  is  the  spin  quantum,  number ,  equal  to  s  =  1/2.  Therefore, 
the  numerical  value  of  electron  spin  equals  (26.1.3): 

=  +,)*-ur*- 


Space  quantization  of  spin  means  that  the  projection  Lisz 
of  the  spin  vector  L;s  on  the  direction  of  the  external  magnetic 
field  is  determined  by  the  formula 


Fig.  39.2 


d-  state 


Lisz  nx^h, 


where  ms  is  the  spin  magnetic  quantum  number,  differing  from 
the  spin  quantum  number  s  in  that  it  can  assume  two  values: 
not  only  +1/2,  but  — 1/2  as  well*. 


*  Frequently,  no  distinction  is  made  between  the  two  spin 
quantum  numbers  s  and  ms,  referring,  in  this  case,  to  ms  to 
which  two  values,  +1/2,  are  assigned. 
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39.3  Pauli  Exclusion  Principle.  Mendeleev's 
Periodic  Table 

69.3.1  The  simplest  statement  of  the  Pauli  exclusion  principle 
is:  in  any  atom  there  cannot  be  two  electrons  occupying  two 
identical  steady  states,  which  are  specified  by  a  set  of  four  quan¬ 
tum  numbers:  the  principal  n,  orbital  l,  magnetic  m  and  spin 
magnetic  ms  quantum  numbers. 

Also  obeying  the  Pauli  principle,  in  addition  to  electrons,  are 
other  particles  having  half-integral  spin  (in  %  units).  In  any 
system  of  fermions  (41.2.4)  no  two  particles  can  exist  in  identical 
quantum  states  (41.1.4). 

For  the  electrons  in  an  atom,  the  Pauli  principle  is  written  as 
follows: 


Z1  (»,  l,  m,  ms)  =  0  or  1, 

where  Z,  ( n ,  l,  m,  ms)  is  the  number  of  electrons  in  the  state 
specified  by  the  given  set  of  quantum  numbers. 

39.3.2  The  maximum  number  Z2  («,  l,  m)  of  electrons  occu¬ 
pying  states  specified  by  a  set  of  three  quantum  numbers,  n,  l 
and  m,  and  differing  only  in  the  orientations  of  the  spins  of  the 
electrons,  is 

Z2  (n,  l,  m)  =  2, 

because  the  spin  magnetic  quantum  number  ms  can  assume  only 
two  values:  1/2  and  —1/2. 

39.3.3  The  maximum  number  Z3  («,  l)  of  electrons  occupying 
states  specified  by  two  quantum  numbers,  n  and  l,  is 

Z3  (n,  l)  =  2  (2 1  +  1). 

This  is  based  on  the  fact  that  for  a  given  l  value  vector  L i  can 
assume  (21  +  1)  different  orientations  in  space. 

39.3.4  The  maximum  number  of  Z  ( n )  electrons  occupying  states 
specified  by  the  value  n  of  the  principal  quantum  number  is 

l=n-l 

Z  (n)  =  ^  Zs(»>  l)  =  2»2. 

1=0 

Table  39.1  lists  the  maximum  number  of  electrons  occupying 
states  characterized  by  the  given  values  of  the  principal  n  and 
orbital  l  quantum  numbers. 
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TABLE  39.1 


n 

Shell 

Number  of  electrons  occupying  the  states 

Maximum 
number  of 
electrons 

s  V  =  0) 

P(l=D 

d{l  =  2) 

/  (l  =  3) 

go  =  'o 

i 

i 

K 

2 

2 

2 

L 

2 

6 

— 

— 

— 

8 

3 

M 

2 

6 

10 

— 

— 

18 

4 

N 

2 

6 

10 

14 

— 

32 

5 

0 

2 

6 

10 

14 

18 

50 

39.3.5  The  systematics  for  filling  the  electron  states  in  atoms  and 
the  periodicity  with  which  changes  occur  in  the  properties  of 
chemical  elements  enabled  them  all  to  be  arranged  into  Mende¬ 
leev  s  periodic  system  of  elements.  Modern  theory  of  the  periodic 
system  is  based  on  the  following  principles: 

(a)  The  atomic  number  2  of  a  chemical  element  is  equal  to  the 
total  number  of  electrons  in  an  atom  of  the  given  element. 

(b)  The  state  of  the  electrons  in  an  atom  is  determined  by  a  set 
of  four  quantum  numbers:  n,  l,  m  and  ms.  The  distribution  of 
the  electrons  according  to  the  energy  states  should  comply  with 
the  principle  of  minimum  potential  energy:  as  the  number  of 
electrons  is  increased,  each  consecutive  electron  should  occupy 
the  energy  state  with  the  least  possible  energy. 

(c)  The  filling  of  the  energy  states  in  the  atom  by  electrons  should 
proceed  in  accordance  with  the  Pauli  principle  (39.3.1). 

39.3.6  An  electron  shell  is  a  collection  of  all  the  electron  states 
in  an  atom,  which  have  the  same  value  of  the  principal  quantum 
number  n  (39.1.3).  Distinction  is  made  between  the  following 
electron  shells  (Table  39.1):  K  at  n  —  1,  L  at  n  =  2,  M  at  n  =  3, 
N  at  n  =  4,  etc. 

Inside  the  electron  shell,  the  electrons  are  distributed  among 
the  subshells,  each  of  which  corresponds  to  a  certain  value  of  the 
orbital  quantum  number  l  (39.1.8). 

39.3.7  The  order  for  the  filling  of  the  electron  energy  states  in  an 
atom  in  the  shells  and,  within  one  shell,  in  the  subshells,  should 
correspond  to  the  sequence  of  the  energy  levels  with  the  given  n 
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•i nd  l  values,  and  comply  with  the  Pauli  principle  (39.3.5b).  For 
light-weight  atoms,  this  order  consists  of  first  filling  the  shell 
with  the  lower  value  of  re,  and  only  then  filling  the  next  shell  with 
electrons.  Inside  a  given  shell,  first  the  state  with  l  =  0  is  filled, 
I  lien  the  states  with  larger  and  larger  l  values  up  to  l  =  n  —  1 . 
Such  an  ideal  periodic  system  of  elements  would  have  a  structure 
and  number  of  elements  in  each  period  (lengths  of  the  periods) 
corresponding  to  Table  39.1. 

39.3.8  Deviations  from  the  order  indicated  in  Sect.  39.3.7  begin 
with  potassium  ( Z  =  19)  and  are  explained  as  follows.  The  inter¬ 
action  between  electrons  in  the  atom  leads,  at  sufficiently  large 
principal  quantum  numbers  re,  to  a  situation  in  which  states  with 
larger  re  and  lesser  l  values  can  have  less  energy,  i.e.  be  more  favour¬ 
able  from  the  energy  viewpoint  than  states  with  less  re,  but  with 
higher  l  values.  As  a  result  we  have  elements  with  incompletely 
filled  preceding  shells,  but  in  which  subsequent  shells  are  being 
filled.  Chemical  elements,  in  which  the  filling  of  preceding  shells 
is  completed  after  the  partial  filling  of  subsequent  shells,  are  called 
transition  elements. 

39.3.9  The  outer  (valence)  electrons  of  an  atom  are  those  in  the 
shell  with  the  highest  n  value  that  belong  to  the  s-  and  //-sub- 
shells,  i.e.  they  have  1  =  0  or  1  =  1  (39.1.9).  These  electrons 
determine  the  chemical  and  optical  properties  of  the  atoms  (see 
also  Sect.  20.3.1).  The  total  number  of  electrons  in  the  s-  and  p- 
subshells  is  equal  to  8  (Table  39.1).  Most  chemical  reactions  are 
based  on  giving  up  or  attaching  outer  (valence)  electrons.  It 
proves  expedient,'  theoretically,  for  an  atom,  with  the  (s  -1-  p)- 
subshell  less  than  half-filled  to  give  up  valence  electrons,  and  for 
one  with  a  more  than  half-filled  (s  -j-  p)-subshell  to  attach  more 
electrons  from  other  atoms  in  the  process  of  chemical  reactions. 


39.4  Chemical  Bonds  and  Molecular  Structure 

39.4.1  A  molecule  is  the  smallest  particle  of  a  given  substance, 
possessing  its  principal  chemical  properties.  Molecules  consist  of 
either  the  same  or  different  atoms  (39.1.1),  joined  together  by 
interatomic  chemical  bonds.  Chemical  bonds  are  based  on  various 
interactions  of  the  outer  (valence)  electrons  of  the  atoms  (39.3.9). 
This  is  indicated  by  the  changes  in  the  optical  spectrum,  which 
explains  the  behaviour  of  these  electrons  when  molecules  are  form¬ 
ed  of  atoms. 
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A  definite  amount  of  work  must  be  done  to  dissociate  molecules 
into  their  constituent  atoms.  On  the  contrary,  the  formation  of 
molecules  is  accompanied  by  the  evolution  of  energy.  This  shows 
that  there  are  forces  which  bind  atoms  into  molecules.  The  energy 
evolved  in  forming  molecules  is  a  measure  of  the  forces  of  inter¬ 
action  that  join  atoms  into  molecules. 

39.4.2  At  large  distances  r  between  atoms,  distances  that  consid¬ 
erably  exceed  the  linear  dimensions  d  of  the  atoms,  forces  of 
mutual  attraction  exist  between  them.  At  distances  r  comparable 
to  d,  the  forces  become  ones  of  mutual  repulsion,  which  do  not 


permit  the  electrons  of  a 
given  atom  to  penetrate  too 
deeply  into  the  electron  shells 
of  an  adjacent  atom.  As  in 
the  case  of  interaction  be¬ 
tween  molecules  (12.1.4),  the 
repulsive  forces  between  atoms 
are  more  short-ranged,  than 
those  of  attraction  (Fig.  12.1). 
The  signs  of  the  forces  of  at¬ 
traction  and  repulsion  are 
discussed  in  Sect.  12.1.4. 


„Q  „  39.4.3  At  a  certain  distance 

rig.  owm  r0  between  the  atoms  of  a 

diatomic  molecule,  the  op¬ 
positely  [directed  forces  of  attraction  and  repulsion  counter¬ 
balance  each  other  and  their  vector  sum  equals  zero.  The 
distance  r0  corresponds  to  the  minimum  mutual  (potential) 
energy  £n  (r)  of  the  atoms  of  a  diatomic  molecule  (cf.  Sect.  12.1.6) 
(Fig.  39.3).  The  equilibrium  interatomic  distance  r0  in  a  molecule 
is  called  the  bond  length,  or  distance.  The  quantity  D  is  called  the 
dissociation,  or  bond,  energy.  It  is  numerically  equal  to  the  work 
that  must  be  done  to  dissociate  a  molecule  into  its  constituent 
atoms  and  to  separate  them  to  infinite  distances  from  one  another. 
The  dissociation  energy  is  equal  to  the  energy  evolved  in  forming 
a  molecule,  but  is  of  opposite  sign.  Dissociation  energy  is  negative, 
whereas  the  energy  evolved  in  forming  molecules  is  positive. 
39.4.4  Heteropolar  (ionic)  molecules  are  ones  formed  as  the  result 
of  the  conversion  of  interacting  atoms  into  oppositely  electrically 
charged  and  mutually  attracting  ions  (39.1.1).  This  kind  of  bond 
between  the  atoms  of  a  molecule  is  called  a  heteropolar  (or  ionic 
or  electrovalent)  bond.  The  formation  of  heteropolar  molecules  is 
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based  on  the  behaviour  of  atoms  with  differently  filled  outer 
(s  +  p)-subshells,  discussed  in  Sect.  39.3.9.  Typical  heteropolar 
(ionic)  molecules  are  those  of  the  alkali-halide  salts,  formed  by 
ions  from  the  elements  of  the  I  and  VII  groups  of  the  periodic 
system  (39.3.5).  These  include:  NaCl  (Na+Cl-),  Csl  (Cs+I~)  and 
others.  Metals  of  the  first  group  have  low  ionization  potentials  V 
(39.1.5),  whereas  atoms  of  group  VII  are  characterized  by  a  high 
electron  affinity,  which  is  the  amount  of  energy  released  when  an 
electron  is  attached  to  a  metalloid  atom.  The  transition  of  an 
electron  from  a  metal  atom  to  a  metalloid  atom  leads  to  the  forma¬ 
tion  of  ions,  each  of  which  has  a  stable  eight-electron  (s  +  p)- 
subshell  (39.3.9).  As  a  result,  a  stable  heteropolar  (ionic)  molecule 
is  formed. 

39.4.5  Homopolar  ( atomic )  molecules  are  ones  formed  by  the  mu¬ 
tual  attraction  of  neutral  atoms.  The  chemical  bond  of  the  atoms 
of  a  homopolar  molecule  is  said  to  be  covalent.  A  covalent  bond 
has  the  property  of  being  saturated  (saturation  of  a  covalent 
bond),  which  is  manifested  as  the  definite  valency  of  the  atoms. 
An  atom  of  hydrogen  can  be  bound  to  only  one  other  atom,  and 
an  atom  of  carbon  to  no  more  that  four  other  atoms.  The  simplest 
molecule  with  a  covalent  bond  is  the  hydrogen  molecule  H2; 
it  consists  of  two  electrons  and  two  nuclei— protons. 

The  basis  of  the  quantum-mechanical  explanation  of  the  covalent 
bond  in  the  H2  molecule  is  the  indistinguishability  of  identical 
particles,  e.g.  the  electrons  in  a  hydrogen  molecule  (see  also 
Sect.  41.1.2).  The  electrons  in  the  H2  molecule,  each  of  which 
“belongs”  to  a  definite  nucleus,  can  be  interchanged  and  this  still 
leads  to  the  same  system:  an  H2  molecule  consisting  of  two  elec¬ 
trons  and  two  nuclei.  The  indistinguishability  of  electrons  leads 
to  the  occurrence  of  a  special  quantum-mechanical  exchange  interac¬ 
tion  between  two  identical  electrons.  This  interaction  consists 
in  the  electron  of  each  of  the  atoms  in  the  hydrogen  molecule 
spending  a  certain  part  of  its  time  at  the  nucleus  of  the  other  atom. 
This  constitutes  tne  bond  between  the  two  atoms  forming  the 
molecule.  It  follows  from  quantum-mechanical  calculations  that 
when  two  hydrogen  atoms  approach  each  other  to  a  distance 
commensurable  with  the  Bohr  radius  (39.1.9),  and  under  the 
condition  that  the  spins  of  the  electrons  in  the  atoms  are  anti¬ 
parallel  (26.1.3),  the  two  atoms  attract  each  other  and  a  stable 
II2  molecule  is  formed.  With  parallel  spins  of  the  electrons,  the 
two  atoms  of  hydrogen  repel  each  other,  and  no  molecule  of  hydro¬ 
gen  is  formed. 
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39.5  Optical  Properties  of  Molecules. 

Molecular  Spectra 

39.5.1  Owing  to  their  specific  appearance,  the  spectra  of  molecules, 
or  molecular  spectra ,  are  called  band  spectra.  They  consist  of  a  set 
of  more  or  less  wide  bands  formed  of  closely  spaced  spectral  lines. 
Bands  are  observed  in  molecular  spectra  in  the  infrared,  visible 
and  ultraviolet  ranges  of  the  scale  of  electromagnetic  waves. 
(31.4.3).  Closely  spaced  bands  form  a  group  of  bands.  Several 
groups  of  bands  are  observed  for  the  simplest  diatomic  molecules. 
Continuous  broad  emission  (or  absorption)  bands  are  observed  in 
the  visible  and  ultraviolet  regions  of  the  spectra  of  polyatomic 
complex  molecules. 

39.5.2  As  in  the  spectra  of  atoms  (39.1.3),  a  separate  spectral 
line  in  a  molecular  spectrum  appears  as  a  result  of  a  change  in  the 
energy  of  a  molecule.  The  total  energy  Em  of  a  molecule  can  be 
presented  as  the  sum  of  the  following  parts,  assumed  independent 
in  a  first  approximation: 

where  is  the  energy  of  translational  motion  of  the  centre  of 
mass  (2.3.3)  of  the  molecule,  E™  is  the  energy  of  motion  of  the 
electrons  in  the  molecule’s  atoms,  is  the  energy  of  vibrational 
motion  about  their  equilibrium  positions  of  the  nuclei  in  the  mole¬ 
cule’s  atoms,  E™  is  the  energy  of  rotary  motion  of  the  molecule  as 
a  whole,  and  E™  is  the  energy  of  the  nuclei  in  the  molecule’s 
atoms.  The  energy  E™  is  not  quantized  (38.4.3)  and  its- change 
cannot  lead  to  the  production  of  a  molecular  spectrum;  the  effect 
of  E'"  on  the  molecular  spectrum  can  be  neglected  in  a  first  ap¬ 
proximation.  The  energy  E'm  of  the  molecule,  whose  change  gov¬ 
erns  the  molecular  spectrum,  constitutes  the  sum  of  three  com¬ 
ponents: 

E'ia  =  E'"  E™  1-  E"1 . 

39.5.3  According  to  the  Bohr  frequency  condition  (39.1.6),  the 
frequency  v  of  the  photon  emitted  by  the  molecule  when  its  energy 
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slate  changes  is  equal  to 

kE'm  A  A£“ 

V  —  — - —  — — - — I - 1 — | - _ 

h  h  '  h  1  h  ’ 

where  A E™,  AS™  and  A E™  are  the  changes  in  the  corresponding 
parts  of  the  energy  E'm,  which  assume  discrete  quantized  values. 
The  production  of  closely  spaced  lines,  forming  bands  at  various 
parts  of  the  spectrum,  is  explained  by  the  fact  that 

A/?™  «  A£'vn  < &Ef . 

in  the  far-infrared  region  of  the  spectrum  (wavelengths  of  the 
order  of  0.1  to  1  mm),  the  transition  of  the  molecule  from  one  rota¬ 
tional  energy  level  to  another  produces  spectral  lines  of  a  rotation 
spectrum. 

39.5.4  In  the  infrared  region  of  the  spectrum  (wavelengths  from 
one  to  several  dozen  micrometres),  transitions  of  molecules  be- 


vv 


Vibrational 
energy  levels 

Fig.  39.4 


Vv-r 

Rotational  energy 
levels 


tween  vibrational  energy  levels  produce  the  lines  of  the  vibrational 
spectrum  of  the  molecule.  Upon  changes  in  the  vibrational  energy 
levels  of  the  molecule,  its  rotational  energy  levels  are  also  changed 
at  the  same  time.  Hence,  transitions  of  molecules  between  vibra¬ 
tional  levels  are  vibrational-rotational  transitions,  producing 
a  vibrational-rotational  spectrum  with  the  frequencies  vVr 
(Fig.  39.4).  This  spectrum  consists  of  groups  of  closely  spaced  lines 
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determined  by  the  fact  that  the  given  vibrational  transition  is 
accompanied  by  rotational  transitions. 

39.5.5  The  visible  and  ultraviolet  regions  of  the  spectrum  are 
produced  as  a  result  of  transitions  of  the  molecule  between  the 
various  electron  energy  levels.  Each  electron  energy  level  corre¬ 
sponds  to  various  possible  vibrations  of  the  nucleus  in  the  molecule, 
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_ 

ve-v 

<  1 

1  1 

Electron  energy  Vibrational  energy 
levels  levels 


Fig.  39.5 

i.e.  to  a  set  of  vibrational  energy  levels.  Transitions  between 
electron-vibrational  levels  lead  to  the  production  of  an  electron- 
vibrational  spectrum  of  the  molecule,  characterized  by  the  frequen¬ 
cies  veV  of  the  various  lines  (Fig.  39.5).  Since  each  vibrational 
energy  state  corresponds  to  a  system  of  rotational  levels  (39.5.4), 
each  electron-vibrational  transition  corresponds  to  a  certain 
band.  The  whole  electron-vibrational  spectrum  of  the  molecule  in 
the  visible  and  nearby  regions  consists  of  a  system  of  several 
groups  of  bands. 


39.6  Raman  Scattering  of  Light  * 

39.6.1  Raman  scattering  of  light,  or  the  Raman  effect,  refers  to 
the  shifted  frequencies  vs  and  va  observed,  along  with  the  frequen¬ 
cy  v0  of  the  source  of  radiation,  in  the  spectrum  of  light  scattered 
by  a  solid  or  liquid  body.  A  spectral  line  with  the  frequency 
Vg  ===  Vq  —  v  is  called  a  Stokes  (red)  line ;  one  with  the  frequency 
va  =  v0  +  v  is  called  an  anti-Stokes  ( violet )  line.  The  set  of  lines 


*  Commonly  called  combination  scattering  in  the  Soviet  lite¬ 
rature. 
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with  the  frequencies  vs  and  va  form  the  Raman  spectrum  of  the 
molecule,  containing  red  and  violet  satellites  with  the  frequency  v„. 
The  intensity  of  the  violet  satellites  is  less  than  that  of  the  red 
ones,  but  increases  with  the  temperature.  The  intensity  of  the 
r  ed  satellites  is  practically  independent  of  the  temperature. 
39.6.2  The  quantum-mechanical  explanation  of  the  Raman  scat¬ 
tering  phenomenon  consists  in  an  analysis  of  the  interaction  be¬ 
tween  a  photon  of  incident  light  having  the  frequency  v„  and  a 
molecule  at  the  ground  vibrational  energy  level  with  the  ener¬ 
gy  E™v.  If,  as  a  result  of  the  interaction  with  the  photon,  the 
molecule  is  moved  to  higher  vibrational  energy  level  with  the 
energy  E\ fv  >  A™,,  the  energy  AA’n>  =  E™v  —  E™v  required  for 
this  purpose  is  borrowed  from  the  incident  light.  A  photon  with 
the  energy  hv0  is  absorbed  and,  in  its  place,  a  photon  is  produced 
with  lower  frequency  v  and  an  energy  equal  to  hv  =  hv0  —  AEm. 

At  this  the  frequency  v  appears  in  the  scattered  light.  It  equals 


and  corresponds  to  the  frequency  of  a  red  satellite.  The  whole  set 
of  red  satellites  are  produced  in  transitions  of  the  molecule  to 
various  excited  vibrational  energy  states. 

39.6.3  If  a  molecule  at  the  vibrational  energy  level  with  the  ener¬ 
gy  E™v  (39.6.2)  moves,  due  to  interaction  with  a  photon  having 
the  energy  hvu,  to  an  energy  state  with  less  energy  (A™.),  a  violet 
satellite  of  the  frequency  va  is  produced  in  the  scattered  light 
spectrum.  This  frequency 

,  A  Em 


where  A£m  =  Efv  —  Efy. 

The  probability  of  Raman  scattering  with  an  increase  in  frequency 
is  less  than  the  same  scattering  with  a  reduction  in  frequency. 
This  is  due  to  the  fact  that  the  number  of  molecules  at  the  higher 
energy  levels  is  loss  than  the  number  at  the  lower  levels.  Conse¬ 
quently,  the  intensity  of  violet  satellites  is  less  than  that  of  the 
red  ones.  When  the  temperature  is  raised,  the  molecule  popula- 
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tion  of  the  upper  energy  levels  increases,  as  does  the  intensity  of 
the  violet  satellites.  The  number  of  molecules  in  the  ground 
state  changes  only  slightly  upon  heating.  Therefore,  the  intensity 
of  the  red  satellites,  remains  practically  unchanged  as  the  tem¬ 
pera  ture  is  raised. 


39.7  Luminescence.  X  rays 

39.7.1  Luminescence  is  the  radiation  of  light  by  bodies  at  a  tem¬ 
perature  T,  which  is  over  and  above  the  thermal  radiation  (36.1.1) 
at  this  temperature.  The  duration  of  luminescence  substantially 
exceeds  the  periods  of  radiation  of  atomic  systems.  Depending 
upon  the  method  applied  to  excite  the  luminescent  glow,  distinc¬ 
tion  is  made  between  cathodoluminescence,  electroluminescence, 
photoluminescence  and  chemiluminescence.  Luminescent  sub¬ 
stances  are  called  luminophors. 

39.7.2  Luminescent  radiation  is  a  nonequilibrium  one  (cf. 
Sect.  36.1.2)  and  is  caused  by  a  comparatively  small  number  of 
luminescence  centres — atoms,  molecules  or  ions — put  into  an  excit¬ 
ed  state  by  the  source  of  luminescence.  The  return  of  the  excited 
centre  to  its  ground  or  less  excited  state  is  accompanied  by  lumi¬ 
nescent  radiation.  The  duration  of  this  radiation  is  determined  by 
the  duration  of  the  excited  state,  which  depends  on  the  properties 
of  not  only  the  radiating  centres,  but  of  the  medium  surrounding 
them  as  well.  The  duration  of  a  metastable  excited  state  can  reach 
10-4  s,  thereby  correspondingly  increasing  the  duration  of  the 
luminescence. 

39.7.3  Luminescence  that  immediately  ceases  together  with  the 
action  of  the  glow  exciting  source  is  called  fluorescence. 
Luminescence  that  persists  long  after  the  action  of  the  glow  excit¬ 
ing  source  has  ceased  to  act  is  called  phosphorescence. 

The  phenomenon  of  fluorescence  is  associated  with  the  transitions 
of  atoms,  molecules  or  ions  from  ordinary  excited  states  with 
a  duration  of  the  order  of  10~B  s  to  the  ground  state.  Phosphores¬ 
cence,  which  provides  for  prolonged  glow,  is  due  to  the  transition 
of  luminescence  centres  from  metastable  states  to  the  ground  state 
(39.7.2).  The  subdivision  of  luminescence  into  fluorescence  and 
phosphorescence  is  arbitrary;  it  is  sometimes  difficult  to  establish 
a  time-based  boundary  between  them. 

39.7.4  The  excitation  of  luminescence  by  moans  of  an  electron 
beam  is  feasible  provided  that  the  kinetic  energy  of  a  bombarding 
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electron  satisfies  the  inequality 


HT 


^  7?e  Eg, 


where  Ee  and  Es  are  the  total  energies  of  the  luminescent  particle 
in  the  excited  and  ground  states. 

.‘19.7.5  Photoluminescence  is  excited  by  electromagnetic  radiation 
in  the  visible  or  ultraviolet  ranges  and  complies  with  the  Stokes 
law :  the  wavelength  of  photoluminescence  is  usually  longer  than 
that  of  the  exciting  light.  The  quantum-mechanical  basis  for  the 
Stokes  law  is  that  in  the  absorption  of  a  photon  of  exciting  light 
with  the  energy  hv,  a  photon  with  the  energy  Xvium,  less  than  hv, 
is  produced.  The  surplus  energy  hv  —  Mqum  =  Em,  where  Em  is 
energy  expended  on  various  processes  except  photoluminescence. 

As  a  rule,  Em  >  0  and  vjum  <  v,  i.e.  Xium  >  X  in  accordance 
with  the  Stokes  law.  Sometimes,  anti-Stokes  luminescent  radiation 
is  observed.  It  complies  with  a  condition  that  is  the  opposite  of 
the  Stokes  law,  i.e.  Xjum  <  X.  This  occurs  when  a  definite  part 
of  the  energy  of  thermal  motion  of  the  particles  of  the  lumines¬ 
cent  substance  is  added  to  the  energy  hv  of  the  photon  that  excites 
the  luminescence.  Thus 

hv  mm  =  hv  +  akT , 


where  a  is  a  factor  depending  upon  the  nature  of  the  luminescent 
substance,  k  is  Boltzmann’s  constant  (8.4.5),  and  T  is  the  absolute 
temperature. 

39.7.6  Photoluminescent  energy  efficiency  is  the  ratio  of  the  energy 
of  luminescent  radiation  to  the  energy  from  the  source  absorbed 
under  steady-state  conditions  by  the  luminophor. 

The  quantum  yield  of  photoluminescence  is  the  ratio  of  the  number 
of  photons  of  photoluminescent  radiation  to  the  number  of  photons 
of  the  exciting  light,  the  latter  having  constant  energy.  Vavilov's 
law  states  that  the  photoluminescent  energy  efficiency  increases 
proportionally  to  the  wavelength  X  of  the  absorbed  radiation.  Then, 
after  reaching  a  maximum  value  in  the  wavelength  range  from  X 
to  Xmax,  drops  rapidly  to  zero  upon  a  further  increase  in  wave¬ 
length.  With  an  increase  in  the  wavelength  of  the  exciting  light 
of  given  energy,  there  is  an  increase  in  the  number  of  photons 
contained  in  the  given  energy  of  primary  radiation.  Bach  of 
these  photons  can  produce  a  photon  of  the  energy  hv\um.  There- 
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fore,  the  photoluminescent  energy  efficiency  increases  with  the 
wavelength  h.  The  sharp  drop  in  energy  efficiency  at  A,  =  >.max 
indicates  that  photons  with  the  frequency  less  than  vmin  =  c/>.^ax 
cannot  excite  the  particles  of  the  luminophor. 

39.7.7  X  rays  are  electromagnetic  waves  with  a  wavelength  from 
0.01  to  800  A  that  are  produced  by  braking  high-speed  electrons 
by  a  substance.  There  are  two  types  of  short-wave  X  rays. 

At  electron  energies,  not  exceeding  a  certain  critical  value  that 
depends  upon  the  substance  in  which  the  electrons  are  braked, 
white,  or  continuous,  X  rays  are  produced.  It  is  radiated  by  braked 
electrons  and  has  a  continuous  spectrum,  bounded  on  the  short- 
wavelength  side  by  a  certain  boundary  Xmln,  called  the  contin¬ 
uous-spectrum.  boundary.  The  boundary  wavelength  Xrain  de¬ 
pends  upon  the  kinetic  energy  Ek  of  the  electrons  that  produce  the 
bremsstrahlung  and  is  reduced  with  an  increase  in  the  kinetic 
energy  E k.  The  existence  of  is  explained  by  the  fact  that  the 
maximum  energy  Avmax  of  a  photon  of  X  rays,  produced  at  the 
expense  of  the  energy  of  the  electron,  cannot  exceed  i?k.  Thus 

^vmax  = 

Consequently,  >.min  =  c/vmax  =  ch/E k.  Making  use  of  data  on 
the  values  of  Ak  and  Xmin,  this  formula  enables  Planck’s  constant 
(Appendix  II)  to  be  determined.  In  its  time  this  method  was  one 
of  the  most  accurate  and  reliable  ones. 

39.7.8  The  second  type  of  X  rays,  characteristic  X  radiation  of 
atoms  of  matter,  has  a  line  spectrum.  It  is  an  individual  character¬ 
istic  of  a  substance  and  does  not  change  when  the  substance  enters 
a  chemical  compound.  It  follows  that,  contrary  to  optical  spectra, 
characteristic  X  rays  are  associated  with  processes  in  the  deep- 
lying  filled  inner  electron  shells  of  atoms  (39.3.6),  which  undergo 
no  changes  in  chemical  reactions  of  the  atoms.  Line  X-ray  spectra 
consist  of  lines  composing  several  series.  The  same  types  of  line 
series  are  observed  in  various  elements,  differing  only  in  that  the 
like  series  of  lines  in  the  heavier  elements  are  shifted  toward  the 
side  of  shorter  wavelengths. 

39.7.9  In  the  order  of  increasing  wavelengths,  the  series  of  char¬ 
acteristic  X  rays  are  called  the  K-,  L-  M-,  and  A'-series.  As  an 
electron  is  removed  from  one  of  the  inner  shells  of  an  atom  with 
the  nuclear  charge  Ze,  the  vacant  space  is  occupied  by  an  electron 
from  a  shell  farther  from  the  nucleus  and  an  X-ray  photon  is 
emitted.  Thus,  after  removing  an  electron  from  the  X-shell,  the 
filling  of  the  vacancy  by  electrons  from  L-,  M-  and  other  shells, 
produces  the  Ka~,  Kp-  and  Av-lines  that  form  the  X-series. 
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The  frequency  v  of  the  lines  of  characteristic  X  rays  can  be  found 
by  Mosley's  law : 


Yv  =  a(Z-b), 


where  a  is  a  constant  for  the  given  series  of  lines  (in  s~Va),  and  b 
is  the  screening  constant.  This  constant  means  that  an  elec¬ 
tron,  accomplishing  a  transition  corresponding  to  a  certain  line, 
is  subject,  not  to  the  whole  charge  Ze  of  the  nucleus,  but  to  the 
charge  (Z  —  b)  e,  which  has  been  weakened  by  the  screening  action 
of  other  electrons. 


39.8  Stimulated  Emission  of  Radiation.  Lasers 

39.8.1  An  atom  located  in  an  electromagnetic  field  at  an  excited 
energy  level  can,  with  a  certain  probability,  be  transferred  by 
the  effect  of  the  field  to  a  lower  energy  state.  It  is  as  if  the  electro¬ 
magnetic  field  “tumbles”  the  atom  down  from  its  excited  level 
to  the  ground  state  or  only  to  a  less  excited  state.  Such  a  transi¬ 
tion  is  accompanied  by  stimulated  ( induced )  radiation  of  the  sub¬ 
stance,  produced  by  the  action  of  an  electromagnetic  wave  on  it. 

39.8.2  From  the  viewpoint  of  wave  optics  (32.1.1)  the  phenomenon 
of  stimulated  emission  of  radiation  means  that  as  the  electromag¬ 
netic  wave  passes  through  a  substance  its  intensity  (31.2.4)  is 
increased,  i.e.  negative  absorption  of  light  occurs.  But  the  frequency 
(30.2.5)  of  the  wave,  direction  of  its  propagation,  its  phase  (30.2.5) 
and  polarization  (31.1.7)  remain  unchanged.  Stimulated  emission 
of  radiation  is  strictly  coherent  (30.5.1)  to  the  electromagnetic 
wave  passing  through  the  substance  and  producing  the  emis¬ 
sion. 

39.8.3  From  the  quantum-mechanical  point  of  view,  the  coherence 
mentioned  in  Sect.  39.8.2  means  that  the  new  photon,  produced 
as  a  result  of  the  stimulated  emission,  in  no  way  differs  from  the 
photon  it  was  produced  by.  The  new  photon,  produced  in  stimulat¬ 
ed  emission,  strengthens  the  light  passing  through  the  medium. 
As  is  shown  in  Fig.  39.66,  the  process  of  stimulated  emission  leads 
to  the  production  of  two  photons  with  the  energy  h\  in  place  of 
one  such  photon.  But,  in  addition  to  stimulated  emission,  ab¬ 
sorption  of  light  also  occurs.  As  a  result  of  the  absorption  of  a  pho¬ 
ton  by  an  atom  at  the  energy  level  Ex,  the  photon  disappears  and 
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the  atom  moves  to  the  energy  level  E 2  (Fig.  39.6a).  This  process 
reduces  the  intensity  of  the  light  that  passes  through  the  sub¬ 
stance. 

39.8.4  A  medium  is  said  to  be  active  if  in  it  the  intensity  of 
transmitted  light  increases.  This  indicates  that  in  an  active  medi¬ 
um  the  process  of  stimulated  emission  predominates  over  the 
process  of  light  absorption.  Otherwise,  when  the  chief  role  is  played 
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Fig.  39.6 


by  the  absorption  of  light,  the  medium  weakens  the  radiation 
passing  through  it.  An  active  medium  is  also  called  one  with 
negative  absorption  (39.8.1). 

39.8.5  The  absorption  of  light  in  matter  proceeds  in  accordance 
with  the  Bouguer-Lambert  law:  I  =  I0e~ax,  where  a  >  0  is  the 
absorption  factor,  x  is  the  thickness  of  the  absorbing  layer,  /„  is 
the  intensity  (31.2.4)  of  light  at  the  entrance  to  the  layer  (at  x  = 
=  0),  and  /  is  the  intensity  of  the  light  that  has  passed  through 
the  layer  of  thickness  x.  The  Bouguer-Lambert-Fabrikant  law  is 
valid  for  a  medium  with  negative  absorption.  Thus 

/  =  v|a|3c, 

where  |  a  |>0  is  a  positive  quantity  corresponding  to  the  strength¬ 
ening,  rather  than  the  weakening,  of  light  passing  through  an 
active  medium.  At  this  the  intensity  of  the  light  increases  drasti¬ 
cally  with  the  thickness  of  the  layer  of  the  medium  (Fig.  39.7). 
In  other  words,  the  absorption  factor  a  is  a  negative  quantity 
for  an  active  medium. 
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39.8.6  Three  typos  of  optical  processes,  illustrated  in  Fig.  39.8 
can  occur  between  two  energy  levels  with  the  energies  and  E2. 
Under  conditions  in  which  spontaneous  emission  can  be  neglected, 
i.e.  radiation  in  which  the  excited  atoms  (or  molecules  or  ions) 
spontaneously  drop  to  the  ground  state,  the  optical  properties  of 
a  medium  are  determined  by  competition  between  two  processes: 
absorption  and  stimulated  emission.  The  number  of  events  of 
absorption  is  proportional  to  the  concentration  of  particles 


Fig.  39.8 


with  the  energy  E1  at  the  lower  energy  level.  The  number  of  events 
of  stimulated  emission  is  proportional  to  the  concentration  jV2  of 
particles  at  the  upper  energy  level.  The  absorption  factor  a  in  the 
Bouguer-Lambert-Fabrikant  law  (39.8.5)  is  proportional  to  the 
difference  between  the  numbers  of  events  of  absorption  and 
stimulated  emission.  Thus 


a  =  k(N1-  Nt), 


where  k  >  0  is  the  proportionality  factor. 

39.8.7  When  the  system  is  in  a  state  of  thermodynamic  equilib¬ 
rium  (8.3.3),  N 2  <  Ni  and  a  >  0.  This  means  that  the  number  of 
events  of  absorption  usually  exceeds  the  number  of  transitions 
accompanied  by  stimulated  emission  of  radiation.  To  obtain 
a  medium  with  a  negative  absorption  factor  it  is  necessary  to 
provide  for  a  nonequilibrium  state  in  the  system,  at  which  N2  > 
>  Ni.  Such  states  are  said  to  be  inverted.  In  such  a  state  the 
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number  of  events  of  stimulated  emission  of  radiation  exceeds  the 
number  of  events  of  absorption. 

39.8.8  Lasers  (from  the  acronym  for  “Light  Amplification  by  the 
Stimulated  .Emission  of  .Radiation”),  or  generators  of  coherent 
light ,  are  sources  of  light  operating  on  the  principle  of  stimulated 
emission  of  radiation  in  an  active  medium  with  an  inverted 
population  of  energy  levels  (39.8.7).  If  these  devices  operate  in 
the  optical  range,  they  are  known  as  lasers  proper.  Those  that 
operate  in  the  range  of  ultrashort  radiowaves  are  called  masers 
(Aficrowave  Amplification  by  Stimulated  Emission  of  Radia¬ 
tion). 

39.8.9  The  process  of  obtaining  an  inverted  state  (population 
inversion)  in  a  medium  (39.8.7),  required  for  laser  operation,  is 


called  pumping  the  active  medium.  In  practice,  pumping  is  accom¬ 
plished  by  the  three-level  procedure.  One  of  the  first  lasers  operating 
according  to  a  three-level  pumping  procedure  had  a  ruby  for  the 
active  medium  (alumina  Al203  with  the  addition  of  chromium 
oxide  Cr203).  In  the  crystal  lattice  of  alumina,  a  part  of  the  Al 
atoms  are  replaced  by  Cr3+  ions,  which  serve  as  the  active  sub¬ 
stance  accomplishing  the  transitions  with  stimulated  emission. 
A  schematic  diagram  of  the  energy  levels  of  Cr3+  is  given  in 
Fig.  39.9.  The  nearest  to  the  ground-state  level  C  are  two  broad 
energy  bands  A  (41.8.2)  and  the  double  metastable  level  B.  Inten¬ 
sive  radiation,  applied  to  the  ruby  by  green  light  of  a  powerful 
pumping  flashtube,  filled  with  neon  and  krypton,  transfers  the 
chromium  ions  to  the  level  of  the  A  band.  From  here  radiationless 
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I  ransilions  occur  to  the  B  level.  The  surplus  energy  is  imparted 
to  the  ruby  crystal  lattice.  As  a  result,  a  population  inversion 
ol  the  chromium  ions  at  the  B  and  C  levels  is  obtained  (39.8.7) 
and  the  laser  emits  red  light  of  the  wavelengths  6927  A  and  6943  A , 
corresponding  to  chromium  ion  transitions  from  level  B  to  lev¬ 
el  C. 

39.8.10  The  avalanche-type  increase  in  intensity  in  an  active 
medium  (39.8.5)  indicates  that  such  a  medium  operates  as  an 


Semitransparent 
Mirror  mirror 


(d) 


Fig.  39.10 


amplifier  of  electromagnetic  waves.  The  light  amplification  effect 
in  a  laser  is  increased  by  passing  the  light  many  times  through 
the  same  layer  of  the  active  medium.  This  takes  place  by  moans 
of  the  arrangement  shown  in  Fig.  39.1  On.  Photon  A,  travelling 
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parallel  to  the  axis  of  the  active  medium  (a  ruby  rod)  1,  gives 
rise  to  an  avalanche  of  photons  moving  in  the  same  direction 
(Fig.  39.106).  A  part  of  this  avalanche  passes  through  semitranspar¬ 
ent  mirror  3  and  leaves  the  laser,  and  a  part  is  reflected  and 
increases  again  in  the  active  medium  (Fig.  39.10e).  A  part  of  the 
avalanche  of  photons  that  roaches  the  nontransparent  mirror  2 
is  absorbed  by  it,  but  after  reflection  from  this  mirror  the  rest 
move  in  the  same  direction  as  the  initial  “starting”  photon  A 
(Fig.  39.10d).  The  multiply  amplified  stream  of  photons,  leaving 
the  laser  through  the  semitransparent  mirror  3,  produces  an  ex¬ 
tremely  intense  pencil  of  light  rays,  highly  directional  and  with 
very  little  angular  spread. 

Photons  B  and  C  (Fig.  39.106),  travelling  at  an  angle  to  the  axis 
of  the  active  medium,  build  up  an  avalanche  of  photons  which, 
after  multiple  reflection,  disappears  from  the  medium  and  does 
not  participate  in  light  amplification.  The  high  coherency,  sharp 
directional  properties  and  great  intensity  of  laser  radiation  are 
the  basis  for  its  ever-increasing  application  in  science  and  engi¬ 
neering. 


PART  SEVEN 

BASIC  SOLID-STATE  PHYSICS 


CHAPTER  40  STRUCTURE  AND  CERTAIN 
PROPERTIES  OF  SOLIDS 

40.1  Structure  ot  Solids 

40.1.1  Solids  are  bodies  having  constant  shape  and  volume. 
Distinction  is  made  between  crystalline  and  amorphous  solids. 
Externally,  crystals  have  a  regular  geometric  shape.  They  have 
a  periodically  repeated  arrangement  of  their  constituent  particles 
throughout  the  whole  crystal.  This  arrangement  is  called  the 
crystal,  or  space,  lattice.  In  this  sense,  a  crystal  is  said  to  have 
long-range  order  (cf..  Sect.  13.1.2).  Crystals  are  bounded  by  flat 
Jaces,  orderly  arranged  with  respect  to  one  another,  which  converge 
at  the  edges  and  vertices.  Large  separate  crystals  having  the 
shape  of  regular  polyhedrons  are  called  monocrystals.  Their  shape 
is  determined  by  their  chemical  composition.  Poly  crystals  have 
a  fine  crystalline  structure  made  up  of  a  great  number  of  small 
randomly  arranged  crystals  ( crystal  grains  or  crystallites)  that  have 
grown  together. 

Amorphous  solids  (pitch,  glass,  etc.)  are  supercooled  liquids  and 
have  no  clearly  defined  properties  of  crystals. 

In  the  following,  discussion  is  confined  to  crystalline  bodies. 

40.1.2  Each  particle  in  a  crystal  lattice  is  subject  to  the  forces 
of  intermolecular  interaction  (12.1.4).  An  equilibrium  arrange¬ 
ment  of  all  the  particles  of  a  solid  at  the  points  of  The  crystal  lattice 
(8.1.4)  corresponds  to  minimum  free  energy  of  the  crystal  (11.4.5) 
and  its  most  stable  state.  The  particles  at  the  points  of  the  crystal 
lattice  are  located  at  certain  equilibrium  distances  from  one  anoth¬ 
er  called  the  lattice  constants. 

40.1.3  The  main  types  of  crystalline  solids,  distinguished  from 
one  another  by  the  nature  of  the  forces  of  interaction  between 
their  particles  and  the  kinds  of  particles  located  at  the  crystal 
lattice  points,  are  the  following. 
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(a)  Ionic  crystals  (NaCl,  calcium  carbonate  and  other  salts). 
Located  at  the  lattice  points  ate  regularly  alternating  positive 
and  negative  ions,  between  which  are  heteropolar  bonds  (39.4.4). 

(b)  Valence  ( covalent )  crystals  (G,  Ge,  Te,  etc.).  The  neutral  atoms 
at  the  lattice  points  have  homopolar  bonds  (39.4.5)  between  them. 
Such  crystals  are  found  in  semiconductors  (41.10.1)  and  in  many 
organic  solids. 

(c)  Molecular  crystals  (At,  CH4,  paraffin,  etc.).  The  points  of  the 
crystal  lattice  are  occupied  by  molecules  that  are  bound  by  Van 
der  Waals  forces,  mainly  of  the  dispersion  type  (12.1.8). 

(d)  Metals  (Na,  Cu,  Al,  etc.).  At  the  lattice  points  arc  positive 
ions  that  are  formed  after  the  outer  (valence)  electrons  (39.3.9) 
have  been  stripped  from  the  atoms.  These  electrons  form  an 
electron  gas  (16.4.1)  of  collectivized  free  particles. 

The  metallic  bond  is  a  specific  type  of  chemical  bond  (39.4.1) 
that  is  developed  between  the  ions  in  the  crystal  lattice  and  the 
electron  gas.  The  electrons  “constrict”  the  positive  ions  (mainly 
by  electrostatic  forces)  and  thereby  compensate  for  repulsion 
between  the  ions.  At  distances  between  the  ions  equal  to  the 
lattice  constant  (40.1.2),  the  metallic  crystal  acquires  a  stable 
state. 

40.1.4  The  anharmonic  nature  of  the  thermal  vibration  of  the 
particles  at  the  crystal  lattice  points  (8.1.4)  is  manifested  in  the 
fact  that  the  dependence  of  the  potential  energy  of  interaction  of  the 
particles  on  their  displacement  from  the  equilibrium  position  is  not 
a  parabolic  one.  The  force  acting  on  a  particle  is  not  quasi-elastic 
(40.3.5).  These  facts  are  of  prime  importance  in  understanding 
certain  thermal  (40.2.4)  and  electrical  (41.5.4)  properties  of  solids. 

40.1.5  A  typical  property  of  monocrystals  (40.1.1)  is  their  aniso¬ 
tropy  ( crystalline  anisotropy),  which  is  the  dependence  of  the  phys¬ 
ical  properties  of  solids  (thermal,  elastic,  electrical  and  optical 
properties)  on  the  direction  in  the  crystal. 


40.2  Thermal  Expansion  of  Solids 

40.2.1  A  solid  is  subject  to  thermal  expansion,  of  cither  linear  or 
volume  type,  when  its  temperature  is  raised.  Both  kinds  of  therm¬ 
al  expansion  are  characterized  by  the  average  coefficients  of  linear 
a  1  and  cubic,  or  volume,  av  expansion  for  the  given  temperature 
range. 
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10.2.2  If  l0  is  the  length  of  a  body  at  the  temperature  0  °C,  its 
elongation  A l  upon  being  heated  to  the  temperature  t° C  is 

A  l  —  ail0t, 

from  which  a;  =  Al/l0t. 

The  linear  expansion  coefficient  characterizes  the  elongation  per 
unit  length  A l/l0  as  the  body  is  heated  one  degree.  For  most  solids 
a i  «  10~6  to  10~6  K_1  and  depends  only  slightly  on  the  tempera- 
lure. 

10.2.3  When  a  body  is  heated  from  0  °C  to  t  °C,  its  volume 
increases  from  V0  to  V  according  to  the  law 

V  —  V0  (1  -j-  C tyt), 

from  which  av  —  A  V/V„t.  The  cubic  expansion  coefficient  char¬ 
acterizes  the  volume  increase  per  unit  volume  A VI V0  as  the  body 
is  heated  by  one  degree. 

I  n  a  first  approximation,  the  relation  between  the  coefficients  av 
and  a.i  is  of  the  form 

O-y  3«;.  / 

10.2.4  The  thermal  expansion  of  solids  is  due  t(/the  anharmonici- 
l  y  of  the  thermal  vibration  of  the  particles  in  tlie  lattice  (40.1.4). 

I  f  r0  is  the  equilibrium  distance  between  adjacent  particles,  then, 
at  some  arbitrary  instant,  the  distance  between  them  is  r  =  r0  + 

|-  x,  where  x  is'  the  mutual  displacement  of  the  particles  from 
their  equilibrium  positions  due  to  thermal  vibration.  The  forces 
between  the  particles  in  the  lattice  are  not  quasi-elastic,  but 
depend  on  the  displacement  x  according  to  the  relation 

Fx  =  — y.x  +  6x2, 

where  x  is  the  coefficient  of  quasi-elastic  force  (40.3.5),  and  b 
is  the  coefficient  of  anharmonicitij  of  the  vibration.  The  term  bx2 
characterizes  the  deviation  of  the  vibration  from  its  harmonic 
lype  due  to  the  different  dependences  of  the  forces  of  attraction 
and  repulsion  on  the  distance. 

When  a  solid  is  in  an  equilibrium  state  the  positions  of  the  crystal 
lattice  points  should  not  vary  with  time  and  the  average  force 
acting  on  each  particle  should  equal  zero: 

(/Av)  -  0. 


582 


STRUCTURE  AND  PROPERTIES  OF  SOLIDS  [40 


If  the  vibration  of  the  particles  was  strictly  harmonic  ( Fx  =  -  y.x), 
the  average  displacement  of  the  particles  would  be  ( x )  = 
=  —  ( Fx)/x  =  0,  i.e.  there  would  be  no  thermal  expansion.  For 
real  anharmonic  vibration,  it  follows  from  the  condition  that 
(Fx)  =  0,  that 

—  x  (x) -f-  b  (x2)  =  0  or  ( 2). 

x 


For  thermal  vibration  with  small  amplitudes  the  potential  energy 
E p  of  the  vibrating  particles  is  equal  (28.2.3)  to: 


But,  according  to  the  law  of  equipartition  of  energy  (10.4.6), 
E p  =  kTl 2,  where  Ic  is  Boltzmann’s  constant  (8.4.5)  and  T  is 
the  absolute  temperature.  Thus 


X  ( x 2)  __  kT 

~ 2  T 


or 


Finally,  (x)  =  bkTlv?.  Hence,  the  average  distance  between 
the  particles  of  a  solid  increases  when  it  is  heated,  leading  to 
thermal  expansion. 


40.3  Brief  Information  on  the  Elastic 
Properties  of  Solids 

40.3.1  Deformation  of  a  solid  is  the  change  in  its  dimensions  and 
volume.  Deformation  is  usually  accompanied  by  a  change  in  the 
shape  of  the  body.  Sometimes  (in  triaxial  tension  or  compression) 
no  change  in  shape  occurs.  Deformation  may  be  caused  by  exter¬ 
nal  forces  acting  on  the  body,  a  change  in  its  temperature  (40.2.1) 
and  other  reasons. 

Deformation  of  a  body  leads  to  displacement  of  its  particles  from 
their  initial  equilibrium  positions  at  the  crystal  lattice  points 
(8.1.1)  to  new  positions.  This  displacement  is  opposed  by  the 
forces  of  interaction  between  the  particles.  Internal  elastic  forces 
are  developed  inside  the  body  to  counterbalance  the  external 
forces  causing  the  deformation. 
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■'ll). 3.2  Deformation  is  said  to  be  elastic  ( elastic  deformation)  if  it 
disappears  as  soon  as  the  external  forces  causing  it  cease  to  act. 
At  this  the  particles  of  the  solid  return  to  their  initial  positions 
of  equilibrium  (40.3.1).  Inelastic  deformation  of  a  solid  leads  to 
irreversible  rearrangement  of  the  crystal  lattice  (40.1.1)  and  the 
shape  of  the  body  is  not  restored  when  the  external  forces  are  re¬ 
moved.  Such  deformation  is  said  to  be  plastic  (40.3.8).  The  transfor¬ 
mation  of  elastic  deformation  into  plastic  deformation  may  occur 
upon  prolonged  action  of  even  small  external  forces  applied  to  the 
body.  The  reverse  transformation  cannot  occur. 

The  degree  of  elasticity  is  the  ratio  of  the  work  that  can  be  done  by 
a  deformed  body  when  the  deforming  forces  are  slowly  removed 
to  the  work  done  by  external  forces  in  deforming  the  body. 

40.3.3  The  physical  quantity  equal  to  the  elastic  force  dFej  per 
unit  area  dS  of  the  cross  section  of  the  body  being  deformed  is 
called  the  stress  a: 


o  = 


dFp 

dS 


The  stress  is  said  to  be  normal  if  force  dFej  is  normal  to  the  surface 
dS  and  tangential,  pr  shearing,  if  the  force  is(  tangential  to  this 
surface. 

40.3.4  A  measure  of  the  deformation  is  the  unib  strain,  or  relative 
deformation.  It  is  equal  to  the  ratio  of  the  absolute  deformation 
Ax  to  the  initial  value  of  the  quantity  x,  which  characterizes  the 
shape  or  dimensions  of  a  deformed  body:  A xfx. 

Hooke's  law  states  that  the  stress  in  an  elastically  deformed  body 
is  proportional  to  its  unit  strain.  Thus 


o  =  Kx 


Ax 


where  Kx  is  the  modulus  of  elasticity,  equal  in  magnitude  to  the 
stress  developed  in  relative  deformation  equal  to  unity.  The 
quantity  ax  =  ilKx  is  called  the  coefficient  of  elasticity.  Hooke’s 
law  is  valid  only  for  sufficiently  small  relative  deformation.  The 
stress  0P  at  which  the  deformation  is  no  longer  proportional  to  the 
stress  is  called  the  proportional  elastic  limit  (point  A  in  Fig.  40.1). 

40.3.5  In  addition  to  elastic  forces,  there  are  forces  of  other  origin 
than  elastic  forces,  but  which  satisfy  the  equation 

F x  “ 
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where  Fx  is  the  projection  of  the  force  on  the  direction  in  which 
the  absolute  linear  deformation  x  takes  place.  They  are  called 
quasi-elastic  forces.  The  quantity  x  is  called  the  coefficient  of  the 
quasi-elastic  force. 

40.3.6  The  simplest  type  of  deformation  is  longitudinal ,  or  uni¬ 
axial,  extension  (or  compression),  which  consists  in  elongating 
(shortening)  a  body  by  the  action  of  the  tensile  (compressive) 
force  F.  Elastic  extension  (compression)  ceases  when  F  =  Fe\, 
where  Fe y  is  the  elastic  force  (40.3.1).  The  unit  strain  (relative 
deformation)  A xlx  =  Mil,  where  M  is  the  change  in  length  due 

to  the  action  of  force  F,  and  l 


is  the  initial  length  of  the  body. 
According  to  Hooke’s  law 


where  the  modulus  of  elasticity 
Kx  =  E  is  called  Young's  modu¬ 
lus.  Young’s  modulus  is  equal  to 
the  normal  stress  a  at  which  the 


linear  dimension  of  the  body  is 
Fig.  40.1  doubled:  M  =  l*. 

40.3.7  The  dependence  of  the 
normal  stress  a  on  the  unit  strain 


Mil  in  uniaxial  extension  is  called  the  tensile  stress-strain  diagram 
(Fig.  40.1).  Beyond  the  proportional  elastic  limit  (40.3.4),  an 
increase  in  a  leads  to  a  considerable  increase  in  Mil.  At  the  stress 


cry,  corresponding  to  point  B  in  the' stress-strain  diagram,  the  unit 
strain  continues  to  increase  without  increasing  the  stress  (hori¬ 
zontal  portion  BB'  in  the  diagram).  The  stress  ay  is  called  the 
yield  point.  The  maximum  stress  crt  that  the  body  can  withstand 
before  rupture,  point  C  in  the  diagram,  is  called  the  ultimate 
strength  or,  if  the  body  is  in  tension,  the  tensile  strength.  Point  D 
corresponds  to  the  rupture  of  the  body. 

40.3.8  If  the  load  on  a  body,  deformed  to  the  stress  oa  (point  a 
in  the  diagram  in  Fig.  40.1)  is  slowly  removed,  the  curve  o  = 
—  cp  (A l/l)  is  the  straight  line  aB,  parallel  to  tho  straight  portion 
OA  of  the  diagram.  The  line  segment  OR  determines  the  permanent 
et,  typical  of  plastic  deformation, 
s 


*  Assuming  that  Hooke’s  law  is  valid  for  such  large  deformation. 
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40. it. 9  Shear  is  the  deformation  of  a  body  in  which  all  its  flat 
layers,  parallel  to  a  certain  shear  plane,  are  displaced  parallel 
I"  one  another  without  distortion  or  any  changes  in  dimensions 
(Fig.  40.2).  Shear  results  from 


l  lie  action  of  tangential  force  F, 
applied  to  face  BC,  parallel  to 
l lie  shear  plane.  Face  AD,  paral¬ 
lel  to  face  BC,  is  fixed.  For 
small  shear  deformation 

,  CC 

y.tanT  =  w, 


SB'  C  C'  F 


where  CC  =  Ax  is  the  absolute 
shear,  y  is  the  shear  angle,  or  Fig.  40.2 
unit  shear  strain,  expressed  in  ra¬ 
dians.  According  to  Hooke’s  law 
40.3.4),  the  unit  shear  strain  is  proportional 


(shearing)  stress  ax  —  F/S,  where  S  is  the  area  of  wx&JZC,  i.e. 


(it  =  Gy. 

The  quantity  G  is  called  the  shear  modulus.  Its  magnitude  is  equal 
to  the  tangential,  or  shear,  stress  that  produces  a  unit  shear  strain 
equal  to  unity  in  the  specimen. 


40.4  Basic  Concepts  of  Phase  Transitions  in  Solids 

40.4.1  The  heating  of  a  solid  crystalline  body  leads  to  an  increase 
in  the  amplitude  of  the  anharmonic  thermal  vibration  at  the  points 
of  the  crystal  lattice  (40.2.4)  and  to  an  increase  in  the  average 
inlerpoint  distances  in  the  lattice  (thermal  expansion,  Sect. 
40.2.1).  Intense  heating  destroys  the  crystal  lattice  and  leads 
to  the  transition  of  the  substance  from  its  solid  to  its  liquid  or 
vapour  phase  (first-order  phase  transition,  Sect.  12.4.2). 

40.4.2  The  transition  of  a  substance  from  the  solid  to  the  gaseous 
state  is  called  sublimation.  Transition  from  the  solid  to  the  liquid 
state  is  called  melting,  or  fusion.  Melting  begins  at  a  definite 
temperature  Tm  for  the  given  pressure;  it  is  called  the  melting 
point.  This  temperature  remains  constant  during  the  melting 


586 


STRUCTURE  AND  PROPERTIES  OF  SOLIDS  [40 


process.  Shown  in  Fig.  40.3  is  the  dependence  T  (Q) ,  where  Q  is 
the  quantity  of  heat  (9.2.1)  delivered  to  the  solid  being  heated. 
The  isothermal  portion  BC  corresponds  to  a  two-phase  system: 
solid-liquid  (cf.  Sect.  12.3.2,  where  the  two-phase  system  consists 
of  a  liquid  and  saturated  vapour).  In  the  melting  process,  the 
substance  is  converted  from  a  more  ordered  crystalline  state  to  a 


less  ordered  liquid  stale.  According  to  the  second  law  of  thermo¬ 
dynamics  (11.3.2),  melting  is  associated  with  an  increase  in  the 
entropy  (11.4.3)  of  the  system. 

40.4.3  The  quantity  of  heat  rm  required  to  melt  unit  mass  of 
a  solid  at  the  temperature  Tm  is  called  the  heat  of  fusion ,  or  of 
melting.  It  is  also  called  the  latent  heat.  It  follows  from  the  first 
law  of  thermodynamics  (9.3.1)  that 

rm  =  u\  —  us  +  P  (”1  —  vs), 

where  u,  and  us  are  the  specific  internal  energies  (9.1.2)  of  the 
substance  (internal  energies  per  unit  volume)  in  the  liquid  and 
solid  phases,  ^  and  ys  are  the  specific  volumes  of  the  liquid  and 
solid  phases,  and  p  is  the  constant  pressure  at  which  the  first- 
order  transition  proceeds  (12.4.2). 

40.4.4  The  melting  point  Tm  depends  upon  the  pressure  p. 
A  change  of  dp  in  the  pressure  leads  to  a  change  in  the  melting 
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point  by  dTm.  The  dependence  Tm  =  Tm  (p)  is  expressed  by  the 
Clapeyron-Clausius  equation  (see  also  Sect.  13.6.5): 

dT m  _ Tm  (iq —  vs) 

dp  rm 

The  notation  is  defined  in  Sect.  40.4.3.  The  Clapeyron-Clausius 
equation  is  valid  for  any  first-order  phase  transition. 

1  n  the  overwhelming  majority  of  cases  the  melting  of  a  substance 
leads  to  an  increase  in  its  specific  volume,  i.e.  iq  >  vs  and 
<ITm/dp  >  0  (rm  >  0),  i.e.  the  melting  point  increases  with  the 
pressure.  Such  substances  can  be  in  the  solid  state  at  high  pres¬ 
sures  in  the  temperature  region  above  the  critical  temperature 

(12.3.1) .  ] 

The  specific  volume  of  certain  substances  (water,  bismuth,  gal¬ 
lium)  increases  in  melting,  so  that  iq  <  vs.  Ice,  for  instance, 
at  0  °C  has  a  lower  density 
l  ban  water.  For  such  substances 
<l,Tmldp  <  0,  i.e.  the  melting 
point  decreases  as  the  pressure 
is  increased. 

40.4.5  In  a  pT  diagram,  the  de¬ 
pendence  of  the  temperature  on 
the  pressure  p  —  f  (T)  in  a  first-or¬ 
der  phase  transition  is  represent¬ 
ed  by  a  curve  on  which  each  point 
corresponds  to  equilibrium  of 
two  existing  phases.  The  vapour 
generation  curve,  for  example, 
shows  the  equilibrium  of  the 
two-phase  system:  liquid-vapour. 

Since  vv  —  iq  >  0  (where  vv  is 
Ihe  specific  volume  of  the  vapour), 
it  follows  from  the  Clapeyron-Clausius  equation  (40.4.4),  that 
dpIdT  >  0.  The  curve  p  =  f  ( T ),  separating  the  regions  of  the 
liquid  and  gaseous  states  of  a  substance,  ends  at  critical  point  C 

(12.3.1)  (Fig.  40.4).  Shown  in  Figs.  40.5  and  40.6  are  the  equilib¬ 
rium  curves  of  the  two-phase  solid-liquid  states  for  the  cases 
when  V]  >  vs  and  iq  <  vs,  respectively. 

40.4.6  In  cooling  liquids  to  a  certain  temperature,  called  the 
freezing  point  T{  (or  temperature  of  solidification),  the  substance  is 
solidified  (crystallized,  or  frozen).  This  is  the  transition  from 


Liquid  /  Vapour 


0  T 

Fig.  40.4 
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a  liquid  to  a  solid  crystalline  state.  In  this  process  a  quantity  of 
heat  is  evolved  that  is  equal  to  the  heat  of  melting  (40.4.3).  The 
freezing  point  is  equal  to  the  melting  point  and  depends  upon  the 
presence  of  impurities  in  the  substance.  Impurities  lower  T j  if 
they  do  not  form  mixed  crystals  with  the  substance.  These  are 
called  solid  solutions.  Sea  water,  for  instance,  containing  dissolved 


ip  Vt  >K, 


5 olid  /  Liquid 


T 


Vt<  Vs 

Liquid 


Solid 


T 


Fig.  40.5  Fig.  40.6 

salts,  freezes  at  a  lower  temperature  than  distilled  water.  This 
serves  as  the  basis  for  producing  cooling  mixtures. 

40.4.7  The  freezing  (crystallization)  of  liquids  is  associated  with 
a  change  in  the  nature  of  the  thermal  motion  of  their  particles. 
The  time  the  particles  remain  in  a  temporary  settled  state  (relaxa¬ 
tion  time,  see  Sect.  13.2.4)  is  increased.  The  forces  of  mutual 
attraction  between  the  particles  change  the  thermal  motion  into 
random  thermal  vibration  about  the  points  of  the  crystal  lattice. 
The  transition  of  the  substance  to  a  more  ordered  phase  is  asso¬ 
ciated  with  a  reduction  in  the  entropy  of  the  system  (11.4.3). 


CHAPTER  41  AN  OUTLINE  OF  THE  QUANTUM 
PHYSICS  OF  SOLIDS 

41.1  Basic  Concepts  of  Quantum  Statistics 

41.1.1  Quantum  statistics  is  a  statistical  method  of  investigation 
(8.2.2)  that  is  applied  to  systems  consisting  of  a  large  number  of 
particles  and  obeying  the  laws  of  quantum  mechanics  (38.1.1). 
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41.1.2  In  contrast  to  the  initial  propositions  of  classical  statisti¬ 
cal  physics  (10. 1.4  and  10.1.5),  quantum  statistics  is  based  on  the 
principle  of  indisUnyuishabilily  uj  identical  parlicles.  All  identical 
particles  (for  example,  all  the  electrons  in  metals  or  all  the  pro¬ 
tons  in  atomic  nuclei)  are  considered  to  be  indistinguishable,  in 
principle,  from  one  another. 

41.1.3  Of  prime  importance  in  quantum  statistics  is  the  problem 

of  distributing  the  particles  with  respect  to  their  coordinates  and 
velocities*.  It  can  be  stated  as  the  problem  of  distributing  parti¬ 
cles  amongst  the  cells  of  “phase  volume”,  an  clement  of  which  is 
equal  to  A]’  =  AxAyAzApxApyApz  (10.1.5).  The  wave-particle 
dualism  of  the  properties  of  particles  of  matter  (38.1.3)  and  the 
Heisenberg  indeterminacy  relations  (38.0.2)  lead  to  a  situation 
in  which  the  clement  of  volume  AT  cannot  be  less  than  h3,  where  h 
is  Planck’s  constant  (Appendix  II).  In  arranging  the  particles 
in  cells  of  volume  h 3  each,  in  accordance  with  the  principle  of 
indistinguishability  of  identical  particles  (41.1.2),  the  question 
posed  concerns  the  number  of  particles  in  the  given  cell,  but  not 
which,  specifically,  of  the  given  sort  of  particles  are  in  the  cell. 
The  state  of  a  system  of  particles  is  not  changed  when  identical 
particles  arc  rearranged  within  the  given  cell  of  the  phase  space, 
or  interchanged  between  the  cells.  \ 

41.1.4  If  an  element  of  volume  AT;  is  divided  info  cells  with 
the  volume  h3,  then  the  ratio  Al'i/h3  =  A gt  determines)  the  number 
of  quantum  states  contained  in  the  volume  AT;,  haying  energy 
within  the  range  from  Ei  to  Et  ~j  -  A Et.  Particles,  of  a  number 
equal  to  AA’;,  contained  in  the  volume  AP;,  can  ^6  distributed 
in  all  possible  ways  between  Agt  states  having  the  energy  Et. 

41.2  Bose-Einsfein  and  Fermi-Dirac  Distribution 
Functions 

41.2.1  Particles  with  integer  or  zero  spin  (in  h  units)  (26.1.3) 
are  called  bosons  (for  instance,  photons,  phonons  and  certain 
nuclei).  Systems  of  such  particles  are  described  by  Bose-Einstein 
quantum  statistics.  Bosons  do  not  comply  with  the  Pauli  exclusion 
principle  (30.3.1)  and  they  are  not  subject  to  restrictions  as  to  the 
number  of  particles  that  may  bo  in  a  certain  quantum  state. 

*  In  classical  statistics,  the  laws  of  Maxwell  (10.3.1)  and  Boltz¬ 
mann  (10.4.2)  enable  a  similar  problem  to  be  solved. 
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41.2.2  The  Bose-Einstein  distribution  function  /b  is  the  average 
population  of  bosons  in  a  state  with  the  given  energy*  i.e.  the 
average  mimber  of  particles  in  one  state.  Thus 

,  AJV  (Et) 

whete  A N  (E{)  is  the  number  of  particles  having  energies  in 
the  range  from  Et  to  Et  +  A Et,  and  A gt  is  the  number  of  quan¬ 
tum  states  within  this  energy  range  (41.1.4).  Function  /b  is 
found  by  considering  the  thermodynamic  probability  P  (11.5.2) 
of  the  distribution  of  the  particles  of  the  system  amongst  the 
quantum  states.  The  most  probable  distribution  is  found  under 
the  condition  that  the  total  number  N  of  particles  and  the  total 
energy  E  of  the  system  are  conserved.  Thus 

]£!  A Nt  =  N  and  V  A NtEi  =  E. 
i  i 

Summation  is  carried  out  over  all  the  quantum  states  of  the 
system. 

41.2.3  An  application  of  the  method  of  Lagrangian  (undeter¬ 
mined)  multipliers  to  find  the  conditional  extremum  enables  the 
following  expression  to  be  obtained  for  the  Bose-Einstein 
distribution  function: 


Here  fc  is  Boltzmann’s  constant  (8.4.5),  and  T  is  the  absolute 
temperature.  The  quantity  jr  =  (U  —  TS  +  pV)/N  is  called 
the  chemical  potential  per  particle,  where  U  is  the  internal  energy 
of  the  system  (9.1.2),  S  is  its  entropy  (11.4.2),  V  is  the  volume 
of  the  system,  and  p  is  the  pressure.  The  chemical  potential  is 
the  work  done  under  isobaric-isothermic  conditions  in  increas¬ 
ing  the  number  of  particles  in  the  system  by  one. 

41.2.4  Particles  with  half-integer  spin  (in  %  units)  (26.1.3)  are 
called  fermions  (electrons,  protons,  neutrons,  etc.).  Systems  of 
fermions  are  described  by  Fermi-Dirac  quantum  statistics. 
Fermions  comply  with  the  Pauli  exclusion  principle  (39.3.1) 
and  in  a  given  quantum  state  of  a  system  of  fermions  there 
can  be  no  more  than  one  particle. 
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41.2.5  The  Fermi-Dlrac  distribution  function  fp  is  determined 
in  a  way  similar  to  that  used  in  Sect.  41.2.2.  Thus 

,  A  N(Ei) 


The  notation  is  defined  in  Sect.  41.2.2.  The  solution  of  the 
problem  of  the  most  probable  distribution  of  the  fermions 
amongst  the  states,  under  the  condition  of  conserving  total 
energy  E  in  the  system  and  the  total  number  N  of  its  particles, 
leads  to  the  following  form  of  the  function  />: 


CXP  (t)  +  1 


where  u,  is  the  chemical  potential  per  particle  (41.2.3). 

41.2.6  The  distribution  functions  in  classical  and  .quantum 
statistics,  introduced  as  the  average  number  of  particles  in  one 
state  (41.2.2),  can.be  ex¬ 
pressed  by  a  single  equation:  *  •  J 

.  Distri-ifution : 

f  = _ t _  .  Maxwell -Boltzmann 

exp  f  )  +  5  '  ^  - Ein5te,n 

v  kl  '  X  /  ,  Fermi -Dirac 


For  the  Maxwell-Boltzmann  NT 

distribution  (cf.  Sect.  10.4.2)  /  - — . 

8  =  0  and  p  =  0;  for  the  /z 

Bose-Einstein  distribution  l-*— - 

8  =  —1;  and  for  the  Fermi-  O  A 

Dirac  distribution  8  =  +1.  ” 

Plots  of  these  three  distri-  ...  ,,  , 

bution  functions  are  shown  1^’  4  ' 
in  Fig.  41.1. 

Example.  Blackbody  radiation  (36.1.8)  is  enclosed  in  a  cavity 
of  volume  F  at  T  —  const,  the  radiation  being  in  a  state  of 
thermodynamic  equilibrium  with  the  walls  of  the  cavity.  This 
radiation  can  be  dealt  with  as  a  photon  gas  complying  with 
Bose-Einstein  statistics,  because  the  spin  of  a  photon  equals  h 
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(Table  43.1).  The  number  of  quantum  states  of  radiation  in 
volume  V  (40.1.4)  is 

,  „  4jt p*dpV  _  8n E*dE  „ 

P  -  Wc  '  • 

where  p  is  the  momentum  of  the  photon  (37.2.2),  related  to  its 
energy  by  the  equation  p  —  Etc,  wlinre  c  is  the  velocity  of  light 
in  free  space.  The  factor  2  appears  because  there  are  two  possible 
directions  in  which  light  can  be  polarized  (31.1.8).  The  energy 
of  the  photon  is  E  =  h\  (37.1.4),  where  v  is  the  frequency. 
According  to  Sect.  41.2.2,  the  number  of  photons  with  frequen¬ 
cies  in  the  range  from  v  to  v  -1-  d\  in  the  volume  V  is 

N _ dg _  8jiv2  dv  V 

1  exp  (h.v/kT)  —  1  c:!  [exp  (hv/IrT) —  1] ' 

Taken  into  account  here  is  the  fact  that  the  chemical  potential  p 
(41.2.3)  is  equal  to  zero  for  a  photon  gas  in  which  the  condition 
for  conserving  the  total  number  of  particles  is  not  complied 
with. 

The  volume  density  of  radiant  energy  in  the  frequency  range 
from  v  to  v  +  d\  is 

m  Av  dN  8nv2  hv  dv 
<o(v,  exp  (hv/h-T)  —  \  ■ 

This  result  is  Planck’s  radiation  formula  for  the  volume  energy 
density  of  blackbody  radiation  (36.1.5). 


41.3  Degeneracy  of  Systems  of  Particles  Described 
by  Quantum  Statistics 

41.3.1  A  system  of  particles  (in  particular,  an  ideal  gas)  is 
said  to  be  degenerate  if  its  properties,  described  by  quantum 
laws,  differ  from  those  of  ordinary  systems  that  obey  classical 
laws.  Deviation  in  the  behaviour  of  Bose  and  Fermi  gases  from 
that  of  classical  Maxwellian-Boltzmann  gas  is  called  gas  degener¬ 
acy.  This  becomes  appreciable  at  extremely  low  temperatures 
and  high  densities  (41.3.3). 
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4  1.3.2  The  degeneracy  parameter  A  is  the  quantity 

.1  =  e^hT, 

where  p,  is  the  chemical  potential  (41.2.3).  Under  the  condition 
of  slight  degeneracy  ( A  <C  1),  the  ones  in  the  denominators  of 
Ihe  quantum  distribution  functions  /b  and  fp  can  be  neglected, 
and  these  functions  become  the  classical  Maxwellian-Boltzmann 
distribution  function  /m-B'  Thus 


The  degeneracy  parameter  is  found  from  the  condition  of  nor¬ 
malization  of  the  distribution  function:  ^  AlVj  =  N  (41.2.2), 

i 

which  indicates  the  conservation  of  the  total  number  of  particles 
in  the  system.  The  condition  for  slight  degeneracy  is  of  the  form 

A _ 2*5! _ <£1  . 

~  (2nmkT)V*  ’  \ 

where  n0  is  the  concentration  of  particles,  m  is  the  mass  of  the 
particles,  lc  is  Boltzmann’s  constant,  h  is  Planck’sxonstant, 
and  T  is  the  absolute  temperature. 

41.3.3  The  degeneracy  temperature  T$  is  the  temperature  at 
which  degeneracy  becomes  appreciable  (41.3.1).  It  is  determined 
from  the  condition 


A  = 


n0h 3 

(2nmkT  a)3/2 


1,  from  which  T$  = 


h*nl/3 

2 nmk  " 


The  temperature  criterion  of  degeneracy  is  as  follows:  when 
T  <  T(\  the  system  of  particles  is  degenerate;  when  T  '>  T,\ 
the  system  of  particles  is  not  degenerate,  and  its  behaviour  is 
described  by  classical  laws. 

I'or  example,  the  degeneracy  parameter  4  »  3  X  10‘5  <  1 
for  hydrogen  under  standard  conditions  ( T  =  300  R  and  n0  * 
s  3  X  1025  m~3).  The  degeneracy  temperature  for  hydrogen 
is  Td  »  1  K.  For  all  other  gases,  heavier  than  hydrogen,  A 
is  even  smaller.  Under  standard  conditions  gases  are  not  degen¬ 
erate.  Degeneracy,  associated  with  the  quantum  properties  of 
gases,  is  manifested  to  a  considerably  less  degree  than  the  dovia- 
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tions  of  gases  from  an  ideal  gas,  which  is  due  to  iiitermolecular 
interaction.  Photon  gas  is  always  degenerate  (Tc|  —  oo)  because 
the  rest  mass  of  a  photon  equals  zero  (m„  =  0,  Table  43.1). 
Photon  gas  is  described  by  Bose-Einstein  quantum  statistics 
(41.2.6). 

41.3.4  Electrons  in  metals  are  an  example  of  a  degenerate  gas. 
Under  ordinary  conditions  n0  »  1028  to  1029  rn“3.  Since  the  mass 
of  the  electron  is  small  (m  x  10~30  kg),  T a  »  (16  to  20)  X 
X  103  K.  Electron  gas  does  not  comply  with  the  classical  Max- 
well-Boltzmann  statistics. 

41.3.5  The  energy  of  degenerate  electron  gas  (41.4.5),  and 
other  gases  described  by  quantum  statistics,  is  not  a  linear 
function  of  the  temperature.  Therefore,  the  simple  physical 
interpretation  of  the  absolute  temperature  (8.3.4)  is  not  suitable 
in  the  region  of  degenerate  gases  (41.3.3). 


41.4  Degenerate  Electron  Fermi  Gas  in  Metals 

41.4.1  The  energy  distribution  of  conduction  electrons  in 
metals  is  described  by  the  Fermi-Dirac  distribution  function 
(41.2.5): 

/F=  - - - 

The  number  of  quantum  states  of  electrons  per  unit  volume  of 
metal,  within  the  energy  range  from  E  to  E  AE  (41.1.4)  is 

dr  4np2  dp  2n  (2m)3/2  E1/2  dE 
AtiW-  /t3  —  h3  —  h3 


Use  is  made  here  of  the  relation  between  the  momentum  p  of 
the  electron  and  its  energy: 

^  m  y/2 


p2  =  2mE ,  clp  — 


m 

He  J 


dE. 


The  number  of  electrons  dn0  (E)  in  unit  volume  of  the  metal, 
having  energies  within  the  range  from  E  to  E  +  AE,  equals, 
according  to  Sect.  41.2.2: 

dn^E^d^E)^^  '^OE 


It* 


exp  [{E—\i)lkT\-[  \  - 
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Factor  2  takes  into  account  the  fact  that  the  electrons  comply 
with  the  Pauli  exclusion  principle  (39.3.1).  This  equation 
expresses  the  energy  distribution  law  for  conduction  electrons 
in  a  metal. 

41.4.2  The  unusual  properties  of  electron  gas  have  an  effect 
on  its  behaviour  at  the  absolute  zero  temperature  ( T  —  0  K). 
If  fi0  is  the  chemical  potential  of  the  gas  at  T  —  0  K  (41.2.3), 
then 

,  =  1 
,F  e(E-n0)/ftr  +  1  ’ 

at  E  <  p0  the  difference  E  —  p0  <  0  and 


- - - (-1 

e(n„-K  )/kr  1 

at  E  —  p0  <  0  and  T  0,  exp  [(pn  —  E)/kT]  ->  oo  and 
fp  -►  1;  at  E  >  p0,  the  difference  E  —  pn  >  0  and  at  T  0, 
exp  [(/?  —  p0 )/kT]  ->  oo  and  fp  -*■  0. 

The  curve  of  the  Fermi  distribution  function  fp  at  T  =  0  K 
is  given  in  Fig.  41.2.  In  the  energy  range  from  0  to  p„,  function 
Jp  is  equal  to  unity.  In  pas¬ 
sing  through  the  value  E  ~p0, 
the  function  drops  abruptly  to 
zero.  At  T  =  0  K,  the  electrons 
of  the  metal  occupy  all  allowed 
energy  levels  with  a  population 
equal  to  unity  (41.2.2)  up  to  the 
level  with  the  energy  E  =  p0. 

All  levels  with  energy  exceeding 
p0  are  empty;  their  population 
equals  zero.  Thus,  p0  is  the  max¬ 
imum  energy  that  conduction 
electrons  can  have  at  absolute  zero  temperature.  This  quantity 
is  called  the  Fermi  energy:  p0  =  Ep.  If  the  consecutive  numbers 
of  the  energy  levels  are  plotted  along  the  axis  of  abscissas  in 
Fig.  41.2,  and  the  corresponding  numbers  of  conduction  electrons 
per  unit  volume  are  the  ordinates,  the  maximum  ordinate 
corresponds  to  the  energy  level  number  n0l 2,  where  n„  is  the 
electron  concentration.  The  highest  level  occupied  by  electrons 
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is  called  the  Fermi  level.  It  corresponds  to  the  Fermi  energy  Ep, 
which  an  electron  has  at  this  level. 

41.4.3  The  energy  distribution  law  for  conduction  electrons  at 
T  —  OK  is  of  the  form 

,  ,  4 it  (2m)3/2jFi/2  dE 

dn  o  (E)  = - - . 


It  is  represented  graphically  by  the  curve  in  Fig.  41.3. 

The  total  number  of  conduction  electrons  in  unit  volume  of 
a  metal  is 


% 


na=\dnn(E)J!^\E^dE^/?^^Ey\ 
0  0 
Hence 

2m  V  8n  ) 


The  average  energy  (E)  of  the  electron  at  T  —  0  K  is 


W  = 


5 


E  p  = 


3_ 

5  2m 


At  n„  =  6  X  1028  m~3  we  obtain,  after  substituting  the  values 
of  h  and  m : 

(E)  =  9  X  lO-1**  J  =  5.4  eV. 


If  we  compare  this  energy  with  the  average  energy  of  a  molecule 
of  nondegenerato  monatomic  gas  {/?)  =-  3kTl2  (10.2.5),  we 
find  the  energy  Ep  that  the  molecule  could  have  at  the  tempera¬ 
ture  T  »  104  K.  In  other  words,  under  ordinary  conditions, 
kT/E P  <  1. 

41.4.4  At  a  temperature  different  from  zero  the  Fermi-Dirac 
distribution  function  fp  is  of  the  form  shown  in  Fig.  41.4  by 
the  full-line  curve.  The  same  figure  shows  the  function  fp  at 
T  =  0  K  in  dash  lines.  The  curves  differ  in  the  kind  of  drop 
the  function  undergoes  near  the  value  E  —  p.  An  abrupt  drop 
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along  a  vertical  line  at  E  —  p0  =  Ev  for  the  case  when  T  = 
=  0  K  (Fig.  41.2)  changes  to  the  smooth  curve  ABC  at  T  =f= 
=/=  0  K.  Of  importance  is  the  fact  that  the  states  of  the  electrons, 
at  the  energy  levels  that  satisfy  the  condition  E  <  p,  do  not 
change  when  heated  from  0  K  to  T.  Distortion  of  the  function 


If  occurs  only  at  its  “tail-end”,  in  an  energy  range  of  the  width 
2kT ,  near  the  value  E  —  p,  at  which  Jf  =  1/2. 

41.4.5  The  chemical  potential  p  (41.2.3)  of  an  electron  gas  at 
the  temperature  T  is 


\\  ^ 

/  kT  \2  , 

,1-!5 

l  EV  j  J 

The  internal  energy  U  (9.1.2)  of  a  mole  of  electron  gas  is 


u^naef  [i  +  4- 


jt2  (kT)2  ] 

E  |  J  ’ 


Here  N ^  is  Avogadro’s  number  (Appendix  II),  and  k  is  Boltz¬ 
mann’s  constant  (8.4.5).  Quantities  p  and  U  remain  practically 
constant  when  the  temperature  is  raised. 

The  molar  heat  capacity  CVll  (9.5.2)  of  an  electron  gas  is 
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A  comparison  with  the  heat  capacity  of  a  nondegenerate  mon¬ 
atomic  gas  (10.7.1)  Cy®ss  =  3NAk/2,  indicates  that 

CV\i  Jis  kT 

/-cl  ass  3  Ev  ' 

At  room  temperature  kTiEy  x  0.01  (41.4.3),  and  therefore 

6'VM,/Cy^ss  ~  0.03.  The  heat  capacity  of  degenerate  electron 

gas  is  negligibly  small.  This  is  so  because  only  a  slight  number 
of  electrons,  within  the  “drop  region”  of  the  Fermi-Dirac  dis¬ 
tribution  function  fy  (hatched  regions  1  and  2  in  Fig.  41.4), 
participate  in  the  process  of  changing  the  internal  energy  of 
the  electron  gas  when  it  is  heated.  This  removes  one  of  the  chiel 
shortcomings  of  the  classical  electron  theory  of  the  electrical 
conduction  of  metals  (20.3.7). 


41.5  Quantum  Theory  of  Electrical  Conduction 
in  Metals 

41.5.1  The  theory  of  the  electrical  conduction  of  metals,  based 
on  quantum  mechanics  (38.1.1)  and  Fermi-Dirac  quantum  sta¬ 
tistics  (41.2.5)  is  called  the  quantum,  theori/  oj  electrical  conduc¬ 
tion  in  metals.  Using  the  Fermi-Dirac  distribution  function 
(41.2.5)  in  this  theory,  Ohm’s  law  for  current  density  (20.3.4) 
was  derived.  Thus 

j  =  xE. 

41.5.2  In  the  quantum  theory  of  electrical  conduction,  the 
electrical  conductivity  x  is  calculated  by  the  formula 

noe~  <M 

K  -  ^ 

mup 

where  n0  is  the  number  of  conduction  electrons  per  unit  volume 
of  the  metal,  (Xf)  is  the  mean  free  path  of  the  electron  (20.3.1) 
having  Fermi  energy  (41.4.2),  and  uy  is  the  velocity  of  thermal 
motion  of  such  an  electron.  Notwithstanding  the  external  re¬ 
semblance  between  this  formula  and  the  formula  for  x  in  the 
classical  electron  theory  (20.3.4),  it  has  an  entirely  different 
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physical  content  and,  in  contrast  to  the  classical  formula,  it 
completely  agrees  with  the  experimental  data. 

''i  1 .5.3  Also  explained  in  the  quantum  theory  of  electrical 
conduction  of  metals  is  the  dependence  of  electrical  conductivity 
on  temperature  (x  oc  1/T),  as  well  as  the  anomalously  large 
mean  free  path  of  the  electron  in  a  metal  (20.3.7).  The  ordered 
motions  of  electrons  in  a  metal— electric  current— is  dealt  with 
in  quantum  theory  as  the  process  of  propagation  of  electron 
de  Broglie  waves  (38.1.4)  scattered  by  anharmonic  thermal 
vibrations  of  the  lattice  points  in  the  metal  (40.2.4).  An  analo¬ 
gous  phenomenon  occurs  in  optics  in  the  propagation  of  a /light 
wave  through  a  turbid  medium,  containing  scattering  centres 
(suspended  particles  in  a  liquid,  colloidal  solutions,'  etc.) 

(34.3.1) .  When  the  distance  between  the  scattering  centres  are 
of  an  order  comparable  to  the  wavelength,  scattering  of  the 
I  ight  occurs  and  the  intensity  of  the  light  decreases  as  it  propa¬ 
gates  in  the  medium.  At  distances  between  the  scattering  centres 
substantially  less  than  the  wavelength,  the  medium  is  optically 
homogeneous  and  no  light  scattering  occurs. 

41.5.4  An  ideal  crystal  lattice,  free  of  all  violations  of  periodi¬ 
city  and  with  stationary  particles  at  the  lattice  points,  behaves 
like  an  optically  homogeneous  medium;  it  does  not  scatter 
electron  waves  and  conduction  electrons  pass  through  such 
a  lattice  without  resistance.  The  scattering  of  electron  waves 
occurs  only  when  the  periodicity  of  the  lattice  is  violated. 
These  are  inhomogeneities  that  act  as  scattering  centres.  Such 
centres  are,  for  example,  density  fluctuations  in  the  lattice 
(11.6.3),  resulting  from  anharmonic  thermal  vibrations  of  the 
positive  ions  of  the  metal.  With  the  exception  of  superconductors 

(41.6.1) ,  this  scattering  is  what  leads  to  the  existence  of  electri¬ 
cal  resistance  in  pure  metals. 

41.5.5  The  scattering  of  the  electron  waves  by  the  thermal 
vibrations  of  the  lattice  points  increases  with  the  temperature, 
whereas  the  mean  free  path  of  the  electrons  is  reduced. 

The  quantity  (Xp)  is  calculated  by  the  formula 

n  \ _  Ed 

nn0kT  ’ 

where  E  is  Young’s  modulus  (40.3.6),  d  is  the  crystal  lattice 
constant,  n0  is  the  number  of  atoms  in  unit  volume,  k  is  Boltz¬ 
mann’s  constant  (8.4.5),  and  T  is  the  absolute  temperature. 
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Here  the  electrical  conductivity  is  found  to  be  inversely  pro¬ 
portional  to  the  absolute  temperature  and  independent  of  n0 
for  monovalent  metals*.  Thus 

e*Ed 

x  = - 

mu^nkT ' 

This  formula  agrees  with  experimental  data  in  the  region  of 
room  temperatures.  The  conductivity,  for  instance,  obtained 
for  silver  ( E  =  107  N/m2,  i=3X  10~l°  m,  m  =  9  X 
X  10~31  kg,  kT  x  4.2  X  10~21  J,  and  up  =  1.4  m/s)  is 
Ktheor  x  5  X  10’  1/Q-m,  whereas  xexper  ~  6.3  X  107  1/Q-m. 

P 


Fig.  41.5  Fig.  41.6 

At  very  low  temperatures  the  formulas  given  above  are  not  val¬ 
id.  The  mean  free  path  under  these  conditions  is  inversely  pro¬ 
portional  to  T5,  which  agrees  with  experimental  data. 

41.5.6  Besides  density  fluctuations,  the  cause  of  electron  wave 
scattering  and  the  electrical  resistance  of  metals  is  violation 
in  the  periodicity  of  the  crystal  lattice  due  to  the  inclusions  of 
impurity  atoms  in  the  lattice.  The  resistivity  p  of  a  metal 
(p  =  1/x)  consists  of  two  parts: 

p  =  pT  +  Pimi 

where  p T  is  the  resistivity  due  to  the  scattering  of  electron  waves 
by  density  fluctuations,  and  pjm  is  the  resistivity  due  to  scat¬ 
tering  by  impurities.  Of  importance  is  the  fact  that  pjm  is 
Independent  of  the  temperature.  Since  as  T  -*■  0,  pT  0  as 

*  The  concentration  of  conduction  electrons  and  ions  in  the 
lattice  coincide  in  such  metals. 
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well,  p  -»■  pim  as  the  temperature  is  lowered.  The  quantity 
Pin,  is  called  the  residual  resistivity.  This  resistivity  remains  a 
property  of  the  metal  when  it  is  cooled  even  down  to  absolute 
zero.  The  dependence  of  the  electrical  resistivity  on  the  tem¬ 
perature  is  shown  in  Fig.  41.5.  The  point  of  intersection  of  the 
extension  of  the  curve  (dash  line)  with  the  axis  of  ordinates 
(to  T  =  0  K)  is  the  residual  resistivity. 


41.6  Superconductivity  / 

41.6.1  The  phenomenon  of  superconductivity  consists  Vin  the 
practically  complete  disappearance  of  electrical  resiMdvity 
in  certain  metals  and  alloys  at  a  certain  temperature  Tc,  called 
the  transition  temperature  to  a  superconductive  state.  Substances 
having  this  property  are  called  superconductors.  Today  over 
500  pure  elements  and  alloys  are  known  that  display  supercon¬ 
ductivity.  The  temperature  range  AB  (Fig.  41.6)  of  the  transi¬ 
tion  to  a  superconductive  state  does  not  exceed  thousandths  of 
a  degree  for  pure  specimens.  It  therefore  proves  expedient  to 
assign  a  definite  value  to  Tc.  The  width  of  the  range  AB  de¬ 
pends  on  nonhomogenei  ties  in  the  metal,  primarily  on  the  pres¬ 
ence  of  impurities  and  internal  stresses  in  the  specimen.  The 
temperatures  Tc  known  at  the  present  time  range  from  0.155  K 
(BiPt)  to  23.2  K  (NbsGe). 

The  isotope  effect  in  superconductors  is  the  fact  that  the  transi¬ 
tion  temperatures  Tc  of  various  isotopes  (42.1.3)  of  the  same 
superconducting  metal  are  inversely  proportional  to  the  square 
roots  of  their  atomic  masses. 

41.6.2  A  sufficiently  strong  magnetic  field  at  a  given  tempera¬ 
ture  destroys  the  superconductive  state  of  a  substance.  When 
a  magnetic  field  is  applied  to  a  superconductor,  the  transition 
temperature  Tc  (41.6.1)  is  lowered. 

A  magnetic  field  of  the  strength  II c  which,  at  the  given  tempera¬ 
ture,  causes  the  transition  of  a  substance  from  the  superconduc¬ 
tive  to  the  normal  state  is  called  th e  critical  field.  As  the  tempe¬ 
rature  of  the  superconductor  is  lowered,  the  field  strength  II c 
increases.  As  a  first  approximation:  1IC  —  II0  [1  —  (T/Tc)2]. 
The  curve  in  Fig.  41.7  separates  the  superconductive  and  normal 
states  of  a  substance. 

The  superconductive  properties  of  a  conductor  disappear  when 
a  sufficiently  heavy  current  is  passed  through  them.  This  is 
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due  to  the  action  of  the  magnetic  field,  set  up  by  the  current, 
which  destroys  the  superconductive  state. 

41.6.3  An  external  magnetic  field,  weaker  than  the  critical 
field  (41.6.2)  does  not  penetrate  into  the  interior  of  a  super¬ 
conductor.  Hence,  the  magnetic  induction  B  (23.1.2)  is  always 
zero  in  the  bulk  of  the  superconductor.  Shown  in  Fig.  41.8  is 


Fig.  41.8 


a  uniform  magnetic  field  (23.1.3)  directed  along  the  axis  of 
a  cylindrical  superconductor.  The  superconductor  seems  to 
“push”  the  magnetic  field  out  of  the  space  it  itself  occupies  and 
is  an  ideal  diamagnetic  material  with  a  magnetic  susceptibility 
of  Xm  =  — 1  (26.4.5).  Here  the  relative  magnetic  permeability 
Mr  =  1  +  '/m  =  0  and  B  =  p„pr#  =  0. 

41.6.4  The  transition  of  a  substance  to  the  superconductive 
state  is  accompanied  by  changes  in  its  thermal  properties. 
In  the  absence  of  a  magnetic  field  and  at  the  transition  temper¬ 
ature  Tc  (41.6.1),  there  is  an  abrupt  change  in  the  heat  capacity 
(9.5.1).  In  the  presence  of  a  magnetic  field  an  isothermal  transi¬ 
tion  from  the  superconductive  to  the  normal  state  is  associated 
with  a  jump  in  the  heat  conductivity  and  heat  capacity.  These 
phenomena  are  typical  features  of  a  second-order  phase  transi¬ 
tion  (12.4.2). 

41.6.5  In  quantum-mechanical  theory,  the  phenomenon  of 
superconductivity  is  considered  to  be  the  superfluidity  (12.4.3) 
of  electrons  in  a  metal  with  the  absence  of  friction  inherent, 
in  superfluidity.  The  conduction  electrons  move  along  a  super¬ 
conductor  unimpeded,  i.e.  without  any  “friction”  between  them 


Sl.fi]  SUPERCONDUCTIVITY 


603 


a  nil  the  crystal  lattice  points.  The  main  feature  of  superconduc- 
lors  is  that  the  mutual  attraction  of  the  electrons  in  them  leads 
lo  the  pairing  of  electrons.  The  cause  of  this  attraction  is  inter¬ 
action  between  electrons,  supplementary  to  Coulomb  repulsion, 
accomplished  by  the  action  of  the  crystal  lattice  and  leading 
lo  the  pairing.  A  model  of  this  attraction  is  illustrated  in 
fig.  41.9.  Conduction  electron 
<■  |  attracts  ion  I  of  the  crystal 
lattice  to  itself,  displacing  the 
ion  from  its  equilibrium  posi¬ 
tion.  This  changes  the  electric 
field  in  the  crystal;  ion  /  sets 
up  an  electric  field  that  acts  on 
l  ho  conduction  electrons,  includ- 
iug  electron  e2.  Thus  the  inter¬ 
action  between  electrons  e1  and 
r.,  is  accomplished  by  means  of 
the  crystal  lattice.  The  displace¬ 
ment  of  the  ion  by  an  electron 
loads  to  a  situation  in  which 
l ho  electron  is  surrounded  by  a  “cloud”  of  positive  charge,  which 
exceeds  the  intrinsic  negative  charge  of  the  electron.  Together 
with  this  “cloud”  the  electron  has  a  net  positive  charge  and  is 
thereby  attracted  to  another  electron. 

41.6.6  In  the  quantum  theory  of  metals  attraction  between 
electrons  (phonon  exchange)  is  associated  with  the  setting  up 
of  elementary  excitations  (quasi-particles)  of  the  crystal  lattice. 
An  electron,  moving  in  a  crystal  and  interacting  with  another 
electron  by  means  of  the  lattice,  brings  the  lattice  into  an  excit¬ 
ed  state.  As  the  lattice  returns  to  the  ground  state  it  emits  a 
quantum  of  energy  of  acoustic  frequency,  called  a  phonon 
(41.7.5),  which  is  absorbed  by  the  other  electron.  The  attraction 
between  electrons  can  be  conceived  of  as  their  exchange  of 
phonons.  This  attraction  is  found  to  bo  most  effective  if  the 
momenta  of  the  interacting  electrons  are  anti-parallel. 

41.6.7  The  initiation  of  a  superconductive  state  in  a  substance 
is  associated  with  the  possibility  of  forming  bound  pairs  of 
electrons  in  the  metal.  The  distance  6  between  the  electrons  of 
a  pair  equals 


Fig.  41.9 
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where  h  is  Planck’s  constant,  iijj  is  the  velocity  ol  an  electron 
at  the  Fermi  level  (41.5.2),  k  is  Boltzmann’s  constant  (8.4.5), 
and  Tc  is  the  transition  temperature  to  a  superconductive  state 
(41.6.1).  An  estimation  indicates  that  6  »  10~4  cm,  i.e.  the 
electrons  forming  a  pair  are  at  a  distance  from  each  other  of  the 
order  of  104  crystal  lattice  constants.  The  whole  electronic 
system  of  a  superconductor  is  a  bound  aggregate  extending 
over  huge  distances  on  the  atomic  scale. 

If  at  arbitrarily  low  temperatures  Coulomb  repulsion  between 
electrons  predominates  over  the  attraction  that  forms  pairs, 
the  substance  (metal  or  alloy)  remains  normal  with  respect 
to  its  electrical  properties.  But  if  the  forces  of  attraction  pre¬ 
dominate  over  those  of  repulsion  at  the  temperature  Tc  (41.6.1), 
the  substance  goes  over  to  the  superconductive  state. 

41.6.8  The  most  important  feature  associated  with  the  pair 
in  the  aggregate  of  electrons  in  a  superconductor  is  the  impossi¬ 
bility  of  energy  exchange  between  the  electrons  and  the  lattice 
in  small  portions,  smaller  than  the  binding  energy  of  paired 
electrons.  This  means  that  in  collisions  between  the  electrons 
and  the  crystal  lattice  points,  the  energy  of  the  electrons  does 
not  change,  and  the  substance  behaves  as  a  superconductor  with 
zero  resistivity.  A  quantum-mechanical  analysis  shows  that 
there  is  no  scattering  of  electron  waves  by  the  thermal  vibra¬ 
tions  of  the  lattice  or  impurities.  This  is  what  indicates  the 
absence  of  electrical  resistance. 

41.6.9  A  definite  amount  of  energy  is  required  to  destroy  the 
state  of  superconductivity.  At  T  3>  Tc  (41.6.1),  the  bound 
states  of  the  paired  electrons  are  violated,  attraction  between 
the  electrons  ceases  and  superconductivity  no  longer  exists. 

41.6.10  Superconductivity  is  being  used  to  produce  strong 
magnetic  fields  because  there  are  no  thermal  losses  when  heavy 
currents,  required  to  set  up  strong  magnetic  fields,  are  passed 
through  superconductors.  But,  since  a  magnetic  field  destroys 
the  state  of  superconductivity  (41.6.2),  special  type-2  super¬ 
conductors  have  been  developed  for  obtaining  strong  magnetic 
fields.  These  are  certain  alloys  and  superconducting  thin  films. 
In  such  superconductors  magnetic  fields  with  a  field  strength 
greater  than  II c  (41.6.2)  penetrate  the  substance  in  the  form  of 
threads  piercing  the  specimen.  The  substance  between  the 
threads  is  found  to  be  superconductive,  and  heavy  currents  can 
set  up  ullrastrong  magnetic  fields.  Magnets,  based  on  the  appli¬ 
cation  of  superconducting  solenoids,  are  being  extensively  used. 
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41.7  Heat  Capacity  of  Solids 

4  1 .7.1  No  distinction  is  made  for  solids  between  the  heat  capaci¬ 
ties  Cv  and  Cp  (9.5.4  and  9.5.7).  In nonmetallic  solids, ^the  larg¬ 
est  contribution  to  heat  capacity  is  made  by  the  energy  of 
thermal  vibration  of  the  particles  at  the  crystal  lattice  points. 
For  metals,  a  slight  contribution  is  made  by  the  degenerate" 
electron  gas  (41.4.5).  / 

4 1 .7.2  The  classical  theory  of  the  heat  capacity  of  solids  is 
based  on  the  principle  of  equipartition  of  energy  (10.6.4). 
A  homogeneous  solid  is  dealt  with  as  a  system  of  particles, 
independent  of  one  another,  having  three  degrees  of  freedom  and 
executing  thermal  vibrations  with  the  same  frequency.  The' 
average  energy  ( E )  per  degree  of  freedom  is  (E)  =  kT  (10.6.5). 
The  internal  energy  U  (9.1.2)  per  mole  of  a  solid  is 


V  =  31V  A  (E)  =  3  NAkT  =  3  RT, 


where  N a  is  Avogadro’s  number  (Appendix  II),  k  is  Boltzmann’s 
constant  (8.4.5),  and  R  =  kN a  is  the  universal  gas  constant 
(8.4.4).  The  molar  heat  capacity*  (9.5.4)  of  a  solid  with  an  atom¬ 
ic;  crystal  lattice  is 


(' 


M-  — 


—r  —  ^R  =  25 — ~T7r 
dT  .  mol-K 


5.97 


cal 

mol-K 


The  Dulong-Petit  law  states  that  the  molar  heat  capacity  of  all 
chemically  simple  crystalline  solids  equals  approximately 
(i  cal/mol-K.  According  to  this  law  tha  molar  heat  capacity 
does  not  depend  on  either  the  temperature  or  any  characteristics 
of  the  crystals.  Experiments  disprove  this  statement  and  indicate 
that  the  heat  capacity  depends  on  the  temperature,  especially 
in  the  region  of  low  temperatures  (Fig.  41.10).  The  reason  why 
the  classical  theory  of  the  heat  capacity  of  solids  disagrees  with 
experimental  data  is  in  the  limited  nature  of  the  principle  of 
equipartition  of  energy  and  its  inadequacy  in  the  region  of  low 
temperatures.  At  these  temperatures  the  average  energy  of  the 


*  Frequently,  the  heat  capacity  is  referred  to  one  gram-atom  of 
the  substance. 
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vibrating  particles  must  be  calculated  in  accordance  with 
the  laws  of  quantum  mechanics  (38.5.9). 

41.7.3  In  the  initial  quantum  theory  of  heat  capacity  of  solids, 
a  crystal  was  treated  as  a  system  of  N  atoms,  each  of  which  is 
a  quantum  harmonic  oscillator.  All  the  atoms  vibrate  indepen¬ 
dently  of  one  another  with  the  same  frequency  w.  The  average 


energy  (/?)  per  degree  of  freedom  of  the  atom  (as  a  quantum 
harmonic  oscillator)  is  equal  (38.5.9)  to 


(E)~ 


hbl  _  hv 

exp  (/iat/kT) —  1  cxp(hv/hT) —  1  ‘ 


The  internal  energy  U  (9.1.2)  of  a  mole  of  a  solid  is  expressed 
as  follows: 


U  =  3Na  </£)— -3JVa 


_ hv _ 

exp  (liv/hT)  —  1 


from  which  the  molar  heat  capacity  of  a  solid  can  be  found 

r  _  on  (  ^ M2  exp  (hvIkT) 

^  \  kT  )  [exp  (hv/kT)  —  l]2  ’ 
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If  we  introduce  the  characteristic  temperature  Tg  =  hv/kT,  then 


This  result  qualitatively  describes  the  dependence  of  the  heat 
capacity  of  solids  on  the  temperature.  Curves  of  this  dependence 
are  plotted  in  Fig.  41.10. 

At  high  temperatures  (hv  <  kT),  as  shown  in  Sect.  38.5.9, 
(E)  =  kT  in  accordance  with  the  law  of  equipartition  of  energy 
(10.6.4)  and  =  3R. 

At  low  temperatures  (hv  >  kT) 

«"V"'T  -1  *  e'^hT  and  ^  =  37?  (Z^)2  e~T*,T 

As  T  -*■  0,  TglT-+  oo  and  exp  (—  T gl T)  ->  0  more  rapidly 
than  the  increase  in  ( Tg/T )2.  Therefore,  as  T  ->  0,  the  heat 
capacity  Cg.  -*■  0.  This  qualitatively  agrees  with  experimental 
data.  But  simple  quantum  theory  does  not  explafnthe  quanti¬ 
tative  behaviour  of  solids  in  the  vicinity  of  absolute  zero. 
41.7.4  The  assumption  that  all  the  atoms  of  k  solid  execute 
thermal  vibrations  independently  of  one  another  at  the  same 
frequency  oversimplifies  the  true  picture  of  particle  vibration 
in  a  crystal  lattice.  The  interaction  between  the  atoms  (or  other 
particles)  of  a  solid  is  so  strong  that  all  N  particles  of  the  body 
form  a  bound  system  having  3 N  degrees  of  freedom,  and  the 
vibration  of  all  the  atoms  may  proceed  at  various  frequencies. 
The  extremely  complex  problem  of  the  frequency  distribution 
of  atomic  vibration  in  a  solid  was,  in  its  time,  the  basis  for  the 
refined  theory  of  heat  capacity  of  solids.  A  solid  has  a  wide 
spectrum  of  vibrational  frequencies.  There  are  vibrations  with 
sufficiently  low  and  with  higher  frequencies.  The  low  frequencies 
correspond  to  elastic  vibration  of  the  crystal  in  the  acoustic  (or 
ultrasonic)  range.  Owing  to  the  bonds  between  the  particles  of 
a  crystal  lattice,  elastic  acoustic  waves  propagate  in  the  crystal. 

The  physical  idea  of  the  theory  of  the  heat  capacity  of  solids, 
refining  the  theory  discussed  in  Sect.  41.7.3,  is  that  the  main 
contribution  to  the  energy  of  thermal  vibration  of  the  crystal 
is  made  by  low-frequency  vibration,  corresponding  to  elastic 
waves  with  wavelengths,  exceeding  the  crystal  lattice  constant. 
This  follows,  in  particular,  from  Fig.  38.7,  which  shows  that 
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the  maximum  contribution  to  the  average  energy  of  a  quantum 
oscillator  is  made  by  vibrations  with  low  frequencies  (38.5.9). 
The  propagating  elastic  waves  behave  as  if  they  were  propagat¬ 
ing  in  a  continuous  medium.  The  atomic  structure  of  the  crystal 
has  no  effect  on  the  propagation  in  it  of  elastic  waves  of  wave¬ 
lengths  exceeding  ^,m in  =  y/vmaXi  where  v  is  the  velocity  of  the 
corresponding  elastic  wave,  and  vraax  is  its  frequency.  The 
wavelength  Xmln  should  be  commensurable  with  the  lattice 
constant,  i.e.  imin  «  (V/N)1/3,  where  N  is  the  number  of 

particles  (crystal  lattice  points)  in  a  crystal  of  volume  V. 

41.7.5  Elastic  waves  in  a  crystal  have  quantum  properties 
consisting  in  the  existence  of  a  minimum  portion  of  energy, 
a  quantum  of  energy  of  the  wave  with  the  given  frequency  v. 
This  enables  a  wave  with  the  frequency  v  to  be  correlated  to 
a  quasi-particle,  the  phonon,  whose  propagation  at  the  velocity 
of  sound  v  corresponds  to  an  acoustic  wave. 

A  phonon  has  the  energy  E  =  hv,  where  h  is  Planck’s  constant 
(Appendix  II),  and  the  quasi-momentum  p  =  hvlv.  The  quasi¬ 
momentum  p  of  the  phonon  is  in  the  direction  coinciding  with 
that  of  the  propagation  of  the  acoustic  wave.  The  most  essential 
difference  between  the  quasi-momentum  and  the  momentum 
is  that  in  the  collision  of  phonons  in  crystals  the  quasi-momentum 
can  be  transferred  in  discrete  portions  to  the  crystal  lattice. 
In  this  case  it  is  not  conserved*. 

In  short,  the  quantization  of  the  acoustic  field  leads  to  phonons 
in  the  same  way  as  the  quantization  of  the  electromagnetic  field 
leads  to  photons  (37.1.4). 

41.7.6  Phonon  spin  equals  zero  and,  therefore,  phonons  comply 
with  Bose-Einstein  statistics  (41.2.2).  Phonons  can  be  emitted 
and  they  can  be  absorbed,  but  their  number  is  not  maintained 
constant,  hence  the  chemical  potential  for  phonon  gas  (41.2.3) 
equals  zero  (cf.  Sect.  41.2.6). 

The  energy  of  a  crystal  can  be  dealt  with  as  the  energy  of  a  pho¬ 
non  gas  and  can  be  calculated  as  in  Sect.  41.2.6.  The  number  dN 
of  phonons  with  frequencies  in  the  range  from  v  to  v  dv  is 


dN  = 


ehv/hT__l  ’ 


*  A  detailed  discussion  of  the  properties  of  quasi-particles, 
associated  with  the  role  of  periodicity  of  crystal  structure,  is 
beyond  the  scope  of  this  handbook. 
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where  dg  is  the  number  of  quantum  states  (number  of  cells) 
in  volume  V  of  the  crystal: 

0  4 np1  dpV  r,  4itv2  dv  T, 


where  p  =  hvlv  is  the  quasi-momentum  of  the  phonon,  and  v 
is  the  velocity  of  sound  in  the  crystal.  The  factor  3  takes  into 
account  the  fact  that  longitudinal  and  transverse  waves,  with 
two  mutually  perpendicular  polarizations,  can  propagate  in 
a  crystal*  (31.1.8).  Thus 

12nv2  d\V 

l,3(eW_1)  • 


The  internal  energy  U  (9.1.2)  of  a  crystal  (accurate  to  zero 
point  energy)  is 


max 

U=  j  hvdN-- 
0 


12  nVh 


v3  dv 


ehv/hT_i  ’ 


where  Vmax  =  v  (3iV/4ixF)1/3  is  the  upper  frequency__boundary 
of  the  phonons  that  contribute  to  the  energy  ofAhermal  vibra¬ 
tion  of  the  crystal.  An  assessment  of  the  order/of  magnitude  of 
vmax  's  given  IP  Sect.  41.7.4. 

Introduced  in  calculating  U  is  the  Debye  characteristic  tempera¬ 
ture  Tx>  —  hvmaxlk.  The  following  two  limiting  cases  are  con¬ 
sidered. 

(a)  High  temperatures  T  'Tq. 


Here  ehv/hT—i  »  ~  and  A  = 

IcT  v3 


kT 


v2  dv  = 


\2nVkT 


•X 


X 


max 


3 


3  NkT. 


*  The  difference  in  the  velocities  of  the  longitudinal  and  trans¬ 
verse  waves  is  ignored,  the  number  of  quantum  states  of  the 
phonons  being  simply  tripled. 
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For  one  mole  of  the  crystal  N  —  NA,  where  NA  is  Avogadro’s 
number  (Appendix  II),  and  the  molar  heat  capacity  Ca  complies 
with  the  Dulong-Petit  law  (41.7.2): 

cn  =  ~=3NAk  =  3R. 


(b)  Low  temperatures  T€TB. 

V 

max 

j\?3  fly 

exp  (hvjkT)  —  1  a  new 

0 

variable,  \=h\lkT,  is  introduced  and  the  upper  limit  is 
replaced  by  c».  Thus 


U  = 


V 

max 

OO 

iznVh 

f  v3  dv  12nVh 

(*L \ 

4  r  i3  di 

V 3 

)  exp  (hv/kT) — 1  v3 

[  h  ) 

im 

1 

o  o 


4nW 
5  h3v3 


The  molar  heat  capacity  is  proportional  to  the  cube  of  the  abso¬ 
lute  temperature  and  obeys  the  Debye  T3  law.  Thus 


Cn  ~~ 


dU  1 6n3kWT3 

dT  ~  5 h3v3 


const  T3. 


41.8  Band  Theory  of  Solids 

41.8.1  Long-range  order  in  crystals  (40.1.1)  leads  to  a  situation 
in  which  an  electric  field,  which  is  a  periodic  function  of  the 
coordinates,  exists  in  solids.  In  a  metal,  for  example,  where  the 
positive  ions  are  located  in  strict  order  at  the  lattice  points, 
the  potential  energy  of  an  electron  varies  along  some  direction 
OX  as  shown  in  Fig.  41.11.  Energy  minima  correspond  to  the 
places  where  the  positive  ions  are  located. 

41.8.2  A  periodic  electric  field  in  a  crystal  of  any  type  (40.1.3) 
essentially  changes  the  energy  states  of  the  electrons  in  a  solid 
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as  compared  to  their  state  in  single  atoms.  In  single  atoms  the 
electrons  are  in  discrete  energy  states  (38.2.5).  In  a  solid,  the 
energy  states  of  the  electrons  are  determined  both  by  their 
interaction  with  the  nucleus  of  their  atom  and  by  the  electric 
field  of  the  crystal  lattice,  i.e.  by  their  interaction  with  other 
atoms.  As  a  result  of  this  interaction,  the  energy  levels  of  the 


electrons  are  split.  Thus,  in  a  solid  containing  N  interacting 
atoms,  N  closely  spaced  energy  levels  are  formed  in  place  of 
the  discrete  energy  level  typical  of  a  single  atom.  These  N 
energy  levels^make  up  an  energy  band.  A  band  energy  spectrum 
uf  the  electrons  is  formed  in  crystals.  /■"" 

41.8.3  The,  formation  of  a  band  energy  spectrum  in  a  crystal 
follows  from  the  indeterminacy  principle  (38.6.6).  Owing  to  the 
finite  lifetime  %  of  the  electron  in  an  excited  state  (t  »  10-8  s) 
(39.1.5)  the  width  A E  of  the  energy  level  in  a  single  atom  is 


A  E&  —  ~  io-7  eV 
r 


( natural  energy  level  width). 

The  valence  electrons  (39.3.9)  of  the  atoms  in  a  crystal  are  more 
weakly  bound  to  the  nuclei  than  the  inner-shell  electrons  and 
can  pass  from  one  atom  to  another  by  means  of  the  tunnel  effect 
(38.7.2).  In  this  way  electrons  penetrate  the  potential  barrier 
(38.7.1)  separating  the  atoms  ^of  the  crystal. 

To  assess  the  average  lifetime  t  of  a  valence  electron  in  a  given 
atom  it  is  assumed  that  the  penetrability  D  of  the  barrier  is 
expressed  by  the  simple  formula  (38.7.2) 

-4-  V‘2.m(U0~E)  L 
I)  «  e  h 
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where  U„  —  E  is  the  barrier  height,  consisting  of  the  ioniza¬ 
tion  energy,  and  amounts  to  approximately  10  oV,  and  L  is 
the  barrier  width,  which  is  commensurate  with  the  crystal 
lattice  constant:  L  x  d  x  10-10  m.  The  frequency  v  of  penetra¬ 
tion  of  an  electron  through  the  barrier  is 

v  v  V 2m(Uo-  E)  L 

v=  —  D  «  —  e  ‘  , 


where  v  is  the  velocity  of  the  electron  in  the  atom  (potential 
well)  and  can  be  taken  equal  to  v  x  106  m/s.  The  average  life¬ 
time  x  of  an  electron  in  the  given  atom  is 

j  d  y  V 2m(U0-  E)  L 
T  =  — ^  —  e  1 
V  v 


After  substituting  the  values  of  all  the  quantities,  x  ^  10~15  s 
is  obtained.  From  the  indeterminacy  relation 

AE  x  —  »  1  eV. 

T 

Instead  of  the  natural  width  AE  x  10~7  eV  of  the  electron 
energy  level  in  a  single  atom,  a  band  of  allowed  values  of  energy 
is  formed  in  a  crystal. 

41.8.4  The  frequency  of  the  penetration  of  inner-shell  electrons 
in  atoms  through  a  potential  barrier  and  their  transitions  to 
other  atoms  is  negligible.  This  is  due  to  the  increase  in  barrier 
height:  U0  —  E  x  103  eV  and  the  increase  in  barrier  width: 
Lx  3  X  10~10  m.  According  to  calculations,  x  ss  1020  years. 
The  broadening  of  the  energy  levels  of  the  inner-shell  electrons 
is  unessential,  and  these  electrons  behave  in  crystals  practically 
in  the  same  way  as  in  single  atoms. 

41.8.5  If  N  is  the  total  number  of  atoms  in  a  solid,  then  the 
energy  band,  formed  from  the  electronic  energy  level  of  the 
valence  electron  of  the  atom  consists  of  N  closely  spaced  levels. 
Adjacent  energy  levels  in  a  band  are  at  about  10~22  eV  from  each 
other. 

The  allowed  energy  bands  are  separated  by  regions  that  are 
forbidden  bands  of  electron  energy  values.  The  width  of  the  for- 
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bidden  bands  is  commensurable  with  that  of  the  allowed  bands. 
With  an  increase  in  energy  the  width  of  the  allowed  energy 
bands  increases  and  that  of  the  forbidden  bands  decreases. 
A  schematic  diagram  of  the  energy  bands  of  a  solid  is  shown 
in  Fig.  41.12. 

41.8.6  The  allowed  energy  bands  in  a  solid  can  be  filled  in 
different  ways  by  electrons.  In  limiting  cases  they  can  be  com¬ 
pletely  filled  or  entirely  empty  (41.9.1).  The  electrons  in  a  solid 
can  move  from  one  allowed  band  to  another.  For  an  electron 


to  move  from  a  lower  band  to  the  next  higher  one  it  is  necessary 
to  expend  energy  equal  to  the  width  of  the  forbidden  band  be¬ 
tween  the  two  allowed  ones  (of  the  order  of  several  eV). 

Very  little  energy  is  required  for  intraband  transitions.  For 
example,  an  energy  of  10“*  to  10~8  eV  is  sufficient  for  this  pur¬ 
pose,  and  it  is  acquired  by  an  electron  in  a  metal  by  the  action 
of  an  electric  field  over  a  free  path  length  at  ordinary  potential 
differences.  But  this  energy  is  insufficient  to  move  an  electron 
from  one  band  to  another.  The  action  of  thermal  excitation  can 
impart  various  energies  to  electrons,  sufficient  both  for  intra¬ 
band  and  interband  transitions. 
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41.9  Metals  and  Dielectrics  in  the  Band  Theory 

41.9.1  Differences  in  the  electric  properties  of  solids  are  ex¬ 
plained  by  the  band  theory  (41.8.2)  by  the  fact  that  the  allowed 
energy  bands  (41.8.2)  are  differently  filled  by  electrons  and  by 
the  different  widths  of  the  forbidden  bands  (41.8.5).  These  two 
factors  determine  whether  a  given  solid  is  a  conductor  of  electric 
current  or  is  a  dielectric.  A  necessary  condition  for  a  solid  to  be 
a  conductor  is  the  availability  of  empty  energy  levels  to  which 
the  electric  field  can  raise  the  electrons.  It  should  be  taken  into 
consideration  that  the  field  can  bring  about  only  intraband 
transitionsjof  the  electrons  (41.8.6). 

41.9.2  Typical  examples  of  conductors  are  the  metals  of  the 
first  group  of  Mendeleev’s  periodic  table  (39.3.5),  for  instance, 
sodium.  In  a  single  atom  of  sodium  there  are  two  filled  electron 
shells,  containing  2  and  8  electrons,  respectively  (39.3.4). 
According  to  the  Pauli  principle  (39.3.1),  the  eleventh  valence 
electron  of  the  sodium  atom  only  fills  half  of  the  upper  energy 
level.  In  a  crystal  of  sodium,  the  first  two  filled  shells  of  single 
atoms  correspond  to  the  energy  bands  completely  filled  with 
electrons. 

The  valence  electrons  of  the  sodium  atoms  in  a  crystal  half 
fill  the  levels  of  the  band  of  allowed  energy  values  (horizontally 
hatched  in  Fig.  41.13).  This  is  called  the  conduction  band  because 
the  electrons  in  it  participate  in  producing  the  conduction  cur¬ 
rent  (20.1.2).  By  the  action  of  the  electric  field,  set  up  in  the 


Forbidden  band 

/2222h 

Empty  energy  levels 

|ggg 

Filled  levels 

Fig.  41.13 

crystal  by  the  source  of  electrical  energy,  the  valence  electrons 
acquire  more  energy  and  move  to  higher,  unoccupied  energy 
levels  in  the  conduction  band.  This  initiates  their  ordered  motion 
and  produces  a  current  in  the  crystal. 
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Thus,  if  the  band  is  not  completely  occupied  by  valence  elec¬ 
trons,  a  solid  is  always  a  conductor  of  electric  current. 

41.9.3  Hybridization  of  allowed  energy  bands  may  occur  in 
crystals.  A  band  formed  in  splitting  an  upper  excited  level  may 
overlap  a  band  produced  by  the  splitting  of  a  lower  state  of 
valence  electrons.  This  phenomenon  is  observed  in  the  crystals 
of  elements  of  the  second  group  in  Mendeleev’s  periodic  table 
(Be,  Cd,  Mg,  Zn  and  others).  This  forms  a  wider  hybrid  band 
in  which  valence  electrons  are  located,  filling  it  only  partly. 
Hence,  a  hybrid  band  is  a  conduction  band,  ana  such  crystals  are 
conductors  of  electric  current. 

41.9.4  The  energy  bands  do  not  overlap  in  solid  dielectrics, 

and  the  band  that  combines  the  energy  levels  of  the  valence 
electrons  of  atoms  or  ions  is  completely  filled  with  electrons, 
whereas  all  higher  bands  are  entirely  empty  at  T  =  0  K.  The 
band  completely  filled  with  electrons  is  called  the  valence  band. 
The  empty  bands  are  conduction  bands.  An  example  of  a  crystal¬ 
line  dielectric  is  common  salt  (NaCl).  The  NaCl  molecule  has 
a  heteropolar  bond  (39.4.4),  which  leads  to  the  formation  of 
Na+  and  Cl-  ions  with  completely  filled  electron  shells.  The 
NaCl  crystal  has  a  valence  band  of  the  Cl"  ion  and  all  lower 
bands  are  completely  filled,  whereas  the  upper  Na+  ion  band, 
located  above,  is  entirely  empty.  The  Cl"  and  Na+  bands  are 
separated  by  6  eV,  and  the  source  of  electrical  energy  cannot 
excite  the  electrons  from  the  completely  filled  Cl~  band  to  the 
empty  Na+  conduction  band.  This  explains  the  dielectric  prop¬ 
erties  of  NaCl.  '  f 


41.10  Electrical  Conduction  of  Semiconductors 

41.10.1  Semiconductors  include  a  large  class  of  substances 
whose  resistivity  varies  in  a  wide  range  from  10~6  to  108  Q-m, 
and  is  reduced  very  rapidly,  according  to  an  exponential  law, 
as  the  temperature  is  raised  (41.10.2).  The  most  typical  and 
extensively  employed  semiconductor  materials  are  the  chemical 
elements  germanium,  silicon  and  tellurium.  In  the  outer  shell 
of  germanium  and  silicon  atoms  there  are  four  valence  electrons, 
which  form  covalent  bonds  (39.4.5)  with  the  valence  electrons 
of  adjacent  atoms.  There  are  no  “free”  valence  electrons  in 
chemically  pure  crystals  of  these  semiconductors.  On  the  basis 
of  the  band  theory  (41.8.2)  crystalline  semiconductors  belong 
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to  the  type  of  solids  in  which  the  valence  band  (41.9.4)  is  sepa¬ 
rated  from  the  empty  conduction  band  (at  T  =  0  K)  by  a  com¬ 
paratively  narrow  energy  region  |A.E0  (Fig.  41.14),  less  than  that 

in  dielectric  crystals  (41.9.4). 


Conduction  band 


Valence  band 


Fig.  41.14 


For  silicon  A E0  =  1.1  eV; 
for  germanium  &E0—0. 72  eV. 
41.10.2  The  electrical  con¬ 
duction  of  chemically  pure 
semiconductors  is  called  the 
intrinsic  conduction.  Electron 
conduction  ( n-type  conduction) 
occurs  when  electrons  are 
transferred  from  the  va¬ 
lence  band  to  the  conduction 
band.  This  requires  an  expen¬ 
diture  of  energy,  not  less  than 
the  width  A/?„  of  the  for¬ 
bidden  band  (Fig.  41.14).  The  quantity  AE0  is  called  the  activa¬ 
tion  energy  of  intrinsic  conduction.  As  the  temperature  of  the 
semiconductor  is  raised ,  an  increased  number  of  electrons  move 
from  the  valence  band  to  the  conduction  band  and  participate 
in  electrical  conduction.  The  conductivity  (20.3.4)  of  semicon¬ 
ductors  increases  with  the  temperature  T  according  to  the  law 

v — -v  -  — AUo/2AT 

ft  -  /VQv  i 

where  k  is  Boltzmann’s  constant  (9.4.5).  The  resistivity  (20.3.4) 
of  semiconductors  drastically  decreases  as  the  temperature  is 
raised  according  to  the  relation 
p==poeAE„/2ftT_ 


This  feature  of  semiconductors  essentially  differentiates  them 
from  metals  (20.3.7). 

41.10.3.  The  transition  of  an  electron  from  the  valence  band 
of  a  semiconductor  to  the  conduction  band  indicates  that  the 
covalent  bonds  (39.4.5)  of  the  atoms  are  violated  in  the  semi¬ 
conductor  crystal.  A  certain  one  of  the  valence  electrons  of  one 
of  the  atoms  in  the  lattice  leaves  its  place.  The  abandoned  place, 
or  vacancy,  displays  surplus  positive  charge.  Such  a  vacancy 
is  called  a  positive  hole,  and  is  treated  as  a  positive  charge 
carrier.  In  the  band  theory  this  means  that  a  vacant  energy 
level  appears  in  the  valence  band  of  the  crystal.  A  positive 
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hole  behaves  like  a  positive  charge,  equal  in  magnitude  to  the 
charge  of  the  electron.  Another  electron  may  move  to  the  place 
(hole)  abandoned  by  the  previous  electron.  This  is  equivalent 
to  the  movement  of  the  positive  hole;  it  appears  at  a  new  place, 
left  by  the  second  electron.  In  the  external  electric  field,  the 
electrons  throughout  the  mass  of  the  semiconductor  move  in 
the  direction  opposite  to  that  of  the  electric  field  strength 

(15.1.2) .  The  positive  holes  move  in  the  direction  of  the  field 
strength,  i.e.  in  the  direction  that  a  positive  charge  would 
travel  when  subject  to  an  electric  field. 

The  electrical  conduction  of  a  semiconductor  that  is  due  to  the 
motion  of  holes  is  called  hole,  or  p-type,  conduction. 

41.10.4  The  intrinsic  conduction  of  semiconductors  is  due  to 
the  presence  of  two  kinds  of  current  carriers:  electrons  in  the 
conduction  band  and  holes  in  the  valence  band.  Each  electron 
excited  to  the  conduction  band  corresponds  to  one  hole  in  the 
valence  band.  The  electron  concentration  ne  and  hole  concentra¬ 
tion  nh  are  the  same  and  increase  rapidly  with  the  temperature 
T  according  to  the  law 

ne  =  ng~  const  e-ABo'/2kr,  z' 

where  AE0  is  the  activation  energy  \)f  intrinsic  conduction 

(41.10.2) ,  and  k  is  Boltzmann’s  constant  (9.4.5).  This  explains 
the  nature  of  the  dependence  of  the  electrical  conductivity  and 
resistivity  of  semiconductors  (41.10.2)  on  the  temperature,  which 
strongly  differs  from  the  dependence  of  these  quantities  in 
metals  (20.3.7). 

41.10.5  The  electrical  conduction  of  semiconductors  due  to 
the  presence  in  them  of  impurity  centres  is  called  extrinsic, 
or  impurity,  conduction. 

Impurity  centres  (or  simply  impurities )  may  be  atoms  or  ions 
of  foreign  elements,  and  various  defects  and  distortion  of  the 
crystal  lattice  (vacant  points,  displacements  due  to  deformation 
of  the  crystal,  etc.). 

Impurities  change  the  periodic  electric  field  in  a  solid  and  affect 
the  motion  of  the  electrons  and  their  energy  states.  The  energy 
levels  of  the  valence  electrons  of  impurity  atoms  are  not  located 
in  the  allowed  energy  bands  (41.9.2)  of  the  main,  or  host,  crys¬ 
tal,  and  impurity  energy  levels  (localized  levels)  are  formed  in  the 
forbidden  band. 
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41.10.6  Impurities  can  be  a  supplementary  source  of  electrons 
in  a  solid.  For  example,  in  the  substitution  of  one  tetra valent 
atom  of  germanium  by  an  atom  of  an  impurity  having  five 
valence  electrons  (phosphorus,  antimony  or  arsenic)  one  of  the 
electrons  of  the  impurity  cannot  form  a  covalent  bond  (39.4.5) 
with  the  germanium  atoms  and  is  thereby  “superfluous”.  The 
energy  levels  of  such  electrons  are  below  the  conduction  band 
of  the  host  crystal  (Fig.  41.15).  Such  levels,  filled  with  electrons, 


are  called  donor  levels,  and  the  atoms  of  the  impurity  that  supply 
the  “superfluous”  electrons  are  called  donor  atoms.  Only  a  small 
amount  of  energy  A Ee  is  required  to  excite  electrons  from  the 
donor  levels  to  the  conduction  band.  This  energy  is  obtained, 
for  instance,  from  thermal  excitation.  For  silicon  A Ee  = 
=  0.054  eV  if  the  impurity  is  arsenic.  When  electrons  are  trans¬ 
ferred  from  the  donor  levels  to  the  conduction  band,  electron 
impurity,  or  extrinsic,  conduction  ( n-type  impurity  conduction ) 
occurs.  Such  semiconductors  are  called  electron  impurity  semi¬ 
conductors  ( n-type  semiconductors). 

41.10.7  If  a  tetravalent  atom  in  the  lattice  of  the  semiconductor 
is  replaced  by  a  trivalent  impurity  atom  (boron,  aluminium  or 
indium)  there  will  be  a  lack  of  one  electron  to  form  saturated 
covalent  bonds.  The  lacking  electron  can  be  borrowed  from  the 
lattice  of  the  neighbouring  atom  of  germanium,  in  which  a  posi- 


/.  i .  1 0]  CONDUCTION  OF  SEMICONDUCTORS 


619 


live  hole  is  thereby  formed  (41.10.3).  The  consecutive  filling 
of  the  holes  formed  in  the  germanium  atoms  by  electrons  is 
equivalent  to  the  movement  of  the  holes  and  leads  to  conduction 
in  the  semiconductor.  The  impurity  energy  levels,  not  occupied 
by  electrons,  are  called  trapping,  or  acceptor,  levels.  In  this 
case,  the  impurity  atoms  are  said  to  be  acceptor  atoms.  Acceptor 
levels  are  located  above  the  upper  edge  of  the  valence  band 
(41.9.4)  of  the  host  crystal  (Fig.  41.16).  When  silicon  crystals 
are  doped  with  boron  (the  addition  of  impurities  to  a  semicond- 


ductor  is  called  doping),  the  acceptor  levels  are  an  amount 
A Eh  =  0.08  eV  above  the  valence  band.  The  transfer  of  elec¬ 
trons  from  the  filled  valence  band  to  the  acceptor  level  produces 
positive  holes  in  this  band,  and  the  valence  band  becomes 
a  band  of  hole  conduction.  This  loads  to  hole  impurity  conduction 
(p-type  impurity  conduction).  Such  semiconductors  are  called 
hole  impurity  semiconductors  (p-type  semiconductors). 

41.10.8  When  a  semiconductor  is  doped  with  both  donor  and 
acceptor  impurities  at  the  same  time,  the  nature  of  the  conduc¬ 
tion  (n-  or  p-type)  depends  upon  which  impurity  produces  the 
higher  concentration  of  current  carriers.  The  concentration  and 
energy  of  the  electrons  (and  holes)  in  semiconductors  depend 
very  strongly,  in  contrast  to  those  of  metals,  on  the  tempera¬ 
ture,  and  increase  when  the  temperature  is  raised. 
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41.11  Concept  of  Contact  Electrical  Phenomena 
in  Metals  and  Semiconductors 

41.11.1  The  work  junction  ®  in  removing  an  electron  from  a 
metal  is  the  work  it  is  necessary  to  do  for  the  electron  to  emerge 
from  the  metal  into  vacuum.  The  work  CD  is  done  against  the 
attractive  forces  developed  by  the  surplus  positive  charge  in  the 
metal  resulting  from  the  removal  of  the  electron.  Besides,  it  is 
necessary  to  overcome  the  repulsion  forces  of  the  previously 
emerged  electrons  if  they  have  not  been  removed  and  form  an 
electron  “cloud”  at  the  surface  of  the  conductor.  The  work 
function  has  a  value  of  the  order  of  several  eV  and  depends  upon 


f 


Fig.  41.17 


the  kind  of  metal  and  the  condition  of  its  surface;  it  is  changed 
by  the  presence  of  dirt,  traces  of  moisture,  etc.  In  the  quantum 
theory  of  solids,  the  work  function  is  reckoned  from  the  upper 
electron-filled  Fermi  level  (41.4.2)  (Fig.  41.17). 

41.11.2  Since  the  fastest  electrons  fly  out  of  the  metal,  a  lack 
of  negative  charge  is  formed  in  the  metallic  conductor  and 
a  surplus  in  the  surrounding  space.  This  is  evident  in  a  very 
thin  layer  (of  a  thickness  equal  to  several  interatomic  distances 
in  the  metal)  on  both  sides  of  the  surface  of  the  conductor.  It 
is  contended  that  the  surface  of  the  metal  is  a  double  electric 
lai/er  (contact  lai/er)  similar  to  an  extremely  thin  capacitor 
(17.2.4). 

41.11.3  The  potential  difference  A  K  (10.2.7),  characterizing 
the  electric  field  of  the  double  layer,  is  called  the  surface  poten - 
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lial  jump  at  the  metal- vacuum  boundary,  or  the  contact  potential 
difference  between  a  metal  and  a  vacuum.  Thus 


AF  = 


tp 


where  cp  is  the  work  function  (41.11.1),  and  e  is  the  magnitude 
of  the  electron  charge. 

there  is  no  electric  field  outside  the  double  layer,  and  the  po¬ 
tential  outside  the  metal  and  beyond  the  double  layer  equals 
zero.  Inside  the  metal  the  potential  is  positive  and  equal  to  AF. 
The  potential  energy  of  the  conduction  electrons  (20.3.1)  is 
negative  and  is  equal  to  — eAF  =  — <P.  The  whole  volume  of 
the  metal  is  a  potential  well  (38.4.1)  for  the  conduction  elec¬ 
trons;  the  depth  of  the  well  equals  the  work  function  (P. 

41.11.4  In  the  contact  of  two  dissimilar  metals,  1  and  2, 
having  the  work  functions  cpj  and  <P2  electrons  pass  predomi¬ 
nantly  from  the  metal  with  the  lower  work  function  to  that  with 
the  higher  one.  The  metals  acquire  unlike  charges  and,  in  a  state 
of  thermodynamic  equilibrium  (8.3.3),  there  is  a  potential  dif¬ 
ference  A  Fj2  between  the  two  contacting  dissimilar  metals.  It  is 
called  the  internal  contact  potential  difference. 

The  potential  difference  AFJ2  between  two  points  located  close 
to  but  outside  the  surfaces  of  the  first  and  second  contacting 
metals  is  called  the  external  contact  potential  difference. 

In  the  contact  of  the  two  metals  shown  in  Fig.  41.17  (Oj  <  <P2). 
the  electrons  pass  predominantly  in  the  direction  from  metal  1 
l.o  metal  2  and  they  become  charged  positively  and  negatively, 
respectively.  At  the  same  time,  a  shift  occurs  in  the  energy 
levels  of  the  electrons:  upward  in  metal  2  and  downward  in 
metal  I.  In  a  state  of  thermodynamic  equilibrium,  the  Fermi 
levels  (41.4.2)  in  the  two  metals  coincide  (Fig.  41.18). 

It  is  evident  from  Fig.  41.18  that  the  external  contact  potential 
difference  AF,'2  depends  upon  the  work  functions  <Pj  and  <P2. 
Thus 

AF ' 


The  internal  contact  potential  difference  depends  upon  the 
Fermi  levels  in  the  metals.  Thus 


AF,,  =  F1  —  F, 
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The  potential  changes  from  V1  to  Vz  within  the  limits  of  the 
double  electric  layer  (41.11.2)  of  thickness  l  (Fig.  41.19)  that 
can  be  approximately  assessed  by  the  formula 

l  —  j/" (in  SI  units). 

Here  n0  is  the  concentration  of  charges  in  the  double  layer,  e0 
is  the  electric  constant  in  SI  units  (14.2.7),  and  e  is  the  electron 
charge. 

The  thickness  of  the  double  layer  in  contacting  metals  is  of 
the  order  of  the  lattice  constants  in  metals,  and  the  resistivity 
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of  this  layer  is  the  same  as  that  of  the  rest  of  the  metal.  Electric 
current  passes  equally  well  in  both  directions  through  the  con¬ 
tact  layer  of  two  metals,  and  there  is  no  rectifying  ( barrier-layer ) 
effect  of  the  contact,  or  junction,  i.e.  unilateral  conduction. 
41.11.5  Contact  between  a  metal  and  an  re-type  semiconductor 
(41.10.6)  leads  to  unilateral  conduction  of  the  junction.  Assume 
that  <t>!>  <f>„,  where  <1)!  and  ®n  are  the  work  functions  of  the 
metal  and  the  re-type  semiconductor,  respectively.  The  arrange¬ 
ment  of  the  conduction  band  of  the  metal  (41.9.2),  the  complete¬ 
ly  filled  band  of  the  semiconductor  and  its  donor  levels  before 
contact  has  been  made  are  shown  in  Fig.  41.20.  After  the  metal 
makes  contact  with  the  semiconductor,  electrons  from  the  donor 
levels  of  the  re-type  semiconductor  (41.10.6)  move  into  the 
metal. 
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The  charge  in  the  contact  layer  (41.11.2)  is  positive  on  the  side 
of  the  n-typc  semiconductor  and  negative  on  the  metal  side. 
Owing  to  the  low  electron  concentration  in  the  semiconductor 
in  comparison  to  that  of  the  metal  (1018  cm-3  instead  of  1022  cm-3), 
the  thickness  of  the  contact  layer  at  the  metal-semiconductor 
boundary  is  found  to  be  approximately  104  times  greater  than 
that  in  the  metal. 

41.11.6  The  small  number  of  current  carriers  in  the  contact 
layer  and  its  large  thickness  indicate  that  the  resistivity  of 
this  layer  is  substantially  greater  than  that  of  the  rest  of  the 


0, 


Metal 


'////./////// 


S'emicohducior 


Before  making  contact 
Fig.  41.20 


semiconductor.  Here  the  contact  layer  is  called  the  depletion, 
or  space-charge,  layer  (formerly  called  the  barrier,  or  blocking, 
layer).  The’depletion  layer  is  the  basis Jlor  rectificationjof  alter¬ 
nating  current  at  a  metal-semiconductor  junction.  When  the 
metal  is  connected  to  the  positive  terminal  of  a  source  of  elec¬ 
trical  energy,  and  the  semiconductor  to  the  negative  terminal, 
the  external  electric  field  is  directed  from  the  metal  toward  the 
semiconductor.  In  this  case  the  electrons  are  pulled  from  the 
bulk  of  the  semiconductor  into  the  double  layer.  The  thickness 
of  this  layer  is  reduced  and  the  conduction  increases.  In  this 
conducting  direction,  electric  current  passes  through  the  metal- 
semiconductor  junction.  When  the  external  electric  field  is 
directed  from  the  semiconductor  toward  the  metal,  the  electrons 
are  forced  out  of  the  double  layer  deep  into  the  semiconductor. 
This  increases  the  width  and  resistance  of  the  depletion  layer 
and  in  this  reverse  ( nonconducting )  direction,  the  metal-semicon¬ 
ductor  junction  blocks  the  electric  current.  This  unilateral 
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conduction  of  the  junction  means  that  an  alternating  current 
passing  through  it  is  rectified. 

41.11.7  The  region  of  contact  of  two  semiconductors  having 
different,  n-  and  p-type,  conduction  is  called  an  electron-hole 
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junction  ( p-n  junction).  A  double  electric  layer  (41.11.2)  is 
formed  at  a  p-n  junction  as  a  result  of  electron  and  hole  motion 
through  the  interface.  Electrons  from  the  ra-type  semiconductor 
pass  into  the  p-type  semiconductor,  and  the  holes  pass  in  the 
opposite  direction.  This  forms  surplus  charges  of  opposite  sign 
in  the  ab  regions  of  the  ra-type  semiconductor  and  the  be  regions 
of  the  p- type  semiconductor  (Fig.  41.21).  The  double  layer  of 
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thickness  l  sets  up  a  contact  electric  field  with  the  strength  Enp 
and  a  certain  potential  difference  at  the  boundaries  of  the  layer. 
This  field  impedes  further  counter  migration  of  electrons  and 
holes.  At  a  definite  thickness  of  the  p-n  junction  a  state  of 
equilibrium  is  set  up,  corresponding  to  the  equalizing  of  the 
Fermi  levels  in  the  two  semiconductors.  This  forms  an  equilibri¬ 
um  contact  layer,  which  is  a  barrier  layer  (41.11.6),  having 
higher  resistance  than  that  of  the  remaining  regions  of  the 
semiconductors. 

41.11.8  When  a  p-n  semiconductor  junction  is  connected  into 
the  external  circuit  of  a  source  of  electrical  energy  in  the  way 
shown  in  Fig.  41.22,  the  external  (applied)  electric  field,  strength¬ 
ening  the  field  of  the  contact  layer,  causes  electrons  to  migrate 
to  the  re-type  semiconductor  and  holes  to  the  p-type  semiconduc¬ 
tor  at  opposite  sides  of  the  junction.  The  thickness  of  the  barrier 
layer  and  its  resistance  increase.  This  direction  of  the  applied 
electric  field  is  said  to  be  the  reverse  one.  In  this  direction  practi¬ 
cally  no  current  passes  through  the  p-n  junction.  When  the  polar¬ 
ity  of  the  external  applied  voltage  is  reversed  (Fig.  41.23), 
the  external  electric  field  with  the  strength  Eext  opposes  the 
field  of  the  contact  layer  (41.11.7).  The  counter  motion  of  elec¬ 
trons  and  holes,  forced  by  the  external  field  to  move  from  the 
bulk  of  the  semiconductors  to  the  region  of  the  p-n  junction, 
increases  the  number  of  mobile  current  carriers  at  the  junction. 
This  reduces  the  thickness  and  resistance  of  the  contact  layer, 
and,  in  this  forw.ard  direction,  current  passes  through  the  p-n 
junction.  The  blocking  effect  of  a  p-n  junction  is  similar  to  the 
rectification  of  a  two-electrode  valve,  or  tube,  called  a  diode. 
Accordingly,  a  semiconductor  device  containing  one  p-n  junc¬ 
tion  is  called  a  semiconductor  diode.  Crystal  triodes,  or  transistors, 
contain  two  p-n  junctions. 
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AND  ELEMENTARY  PARTICLES 


CHAPTER  42  STRUCTURE  AND  BASIC 

PROPERTIES  OF  ATOMIC  NUCLEI 

42.1  Main  Properties  and  Structure  of  the  Nucleus 

42.1.1  The  nucleus  is  the  central  part  of  an  atom;  practically 
all  of  the  atom’s  mass  and  its  positive  charge  are  concentrated 
in  the  nucleus.  All  atomic  nuclei  are  made  up  of  elementary 
particles  (43.1.1)  called  protons  and  neutrons,  considered  to  be 
two  charge  states  of  a  single  particle  called  the  nucleon.  The 
proton  has  a  positive  electric  charge,  equal  in  magnitude  to  that 
of  the  electron  (14.1.2).  The  neutron  has  no  electric  charge. 

42.1.2  The  charge  of  the  nucleus  is  the  quantity  Ze,  where  e  is 
the  electron  charge,  and  Z  is  the  atomic  number  of  the  chemical 
element  in  Mendeleev’s  periodic  table  (39.3.5)  and  is  equal  to 
the  number  of  protons  in  the  nucleus.  Up  to  date  nuclei  are 
known  with  Z  ranging  from  Z  =  1  to  Z  =  107.  For  all  nuclei, 
with  the  exception  of  JH,  JHe  and  certain  other  neutron-defi¬ 
cient  nuclei,  N  ^  Z,  where  N  is  the  number  of  neutrons  in  the 
nucleus.  For  light  nuclei  NlZ  «  1;  for  the  nuclei  of  chemical 
elements  at  the  end  of  the  periodic  system,  NlZ  ~  1.6. 

42.1.3  The  number  of  nucleons  (42.1.1)  in  a  nucleus,  A  —  N  + 
+  Z,  is  called  the  mass,  or  nucleon,  number.  Nucleons  (proton 
and  neutron)  are  assigned  a  mass  number  equal  to  unity.  The 
electron  has  zero  mass  number. 

Nuclei  with  the  same  Z  value,  but  different  A  values  are  called 
isotopes.  Nuclei  with  the  same  A  value,  but  different  Z  values 
are  called  isobars.  The  nucleus  of  chemical  element  X  is  denoted 
by  the  symbol  ^X,  where  X  is  the  symbol  of  the  chemical  ele¬ 
ment. 

About  300  stable  isotopes  of  chemical  elements  are  found  in 
nature,  and  there  are  over  a  1000  man-made  (radioactive) 
isotopes. 


«.1|  PROPERTIES  AND  STRUCTURE  OF  THE  NUCLEUS  62? 


42.1.4  The  size  of  a  nucleus  is  specified  by  the  nuclear  radius  r, 
which  has  only  an  arbitrary  meaning  because  the  nucleus  has 
no  clearly  delined  boundary.  The  empirical  formula  for  the 
nuclear  radius  is  r  =  r^A1^,  where  r0  =  (1.3  to  1.7)  10-16  m 
can  be  interpreted  as  the  proportionality  of  the  volume  of  the 
nucleus  to  the  number  of  nucleons  it  contains. 

The  density  of  nuclear  matter  is  of  the  order  of  magnitude  of 
1()17  kg/m3  and  is  constant  for  all  nuclei.  It  substantially  exceeds 
the  density  of  the  most  dense  of  ordinary  substances. 

42.1.5  Nucleons  in  atomic  nuclei  are  fermions  (41.2.4)  and 

their  spin  equals  ft/ 2.  The  nucleus  of  an  atom  has  intrinsic 
angular  momentum,  called  nuclear  spin,  equal  to  Z-n  = 
=  Y I  (I  + 1)'  ft,  where  I  is  the  inner  (total)  spin  quantum  nuptber 
(cf.  Sect.  26.1.3),  ft  -  -ft/ 2jt,  and  ft  is  Planck’s  constant  (Appen¬ 
dix  II).  / 

The  number  I  may  assume  any  whole  or  half-intogorlvalue: 
0,  1/2,  1,  3/2,  2,  etc.  Nuclei  with  even  mass  numbers  zkhave 
integer  spin  (in  ft  units)  and  comply  with  Bose-Einstein  sta- 
tistics  (41.2.1).  Nuclei  with  odd  mass  numbers  A  have  half¬ 
integer  spin  (in  ft  units)  and  comply  with  Fermi-Dirac  sta¬ 
tistics  (41.2.4). 

42.1.6  Nuclear  particles  have  their  intrinsic  magnetic  moments, 
which  determine  the  total  nuclear  magnetic  moment  Pm^.  The 
unit  of  measurement  of  nuclear  magnetic  moments  is  the  nuclear 
magneton  ujy  analogous  to  Bohr’s  magneton  (26.1.4).  Thus 
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Pn  = 


eh 

2/n  pC 


(in  SI  units), 


(in  Gaussian  units), 


where  e  is  the  magnitude  of  the  electron  charge,  mp  is  the  mass 
of  the  proton,  and  c  is  the  electrodynamic  constant  (Appendix  II). 
The  nuclear  magneton  is  a  factor  oimJme  =  1836.5  times  loss 
than  the  Bohr  magneton.  It  follows  that  the  magnetic  proper¬ 
ties  of  atoms  are  determined  by  those  of  their  electrons. 

The  following  relation  exists  between  nuclear  spin  Ljg,  measured 
in  ft  units,  and  the  magnetic  moment  of  the  nucleus,  measured 
in  nuclear  magnetons: 


/’mN  =  VN^N, 
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where  yjg  is  the  nuclear  gyromagnelic  ratio  (cf.  Sect.  26.1.4). 
The  magnetic  moments  of  the  neutron  and  proton  are  associated 
with  their  complex  internal  structure  (43.6.2).  The  neutron 
has  a  negative  magnetic  moment  equal  to  —(1.91314  ± 
±  0.00005)  pn-  The  spin  of  the  neutron  and  its  magnetic  mo¬ 
ment  are  opposite  in  direction.  The  magnetic  moment  of  the 
proton  is  positive  and  is  equal  to  (2.79277  -|-  0.00003)  pn. 
Its  direction  coincides  with  the  direction  of  proton  spin. 

42.1.7  In  the  general  case,  the  distribution  of  electric  charge 
of  the  proton  in  the  nucleus  is  asymmetrical.  A  measure  of  the 
deviation  of  this  distribution  from  spherical  symmetry  is  the 
electric  quadrupole  moment  of  the  nucleus  Q.  If  the  charge  density 
is  assumed  the  same  throughout  the  nucleus,  Q  is  determined 
only  by  the  shape  of  the  nucleus.  For  an  ellipsoid  of  revolu¬ 
tion 

Q  =  j-Ze  (fc2-“2)- 

where  b  is  the  semiaxis  of  the  ellipsoid  along  its  spin,  and  a 
is  the  semiaxis  in  the  perpendicular  direction.  For  a  nucleus 
stretched  out  along  its  spin,  b  >  a  and  Q  >  0.  For  a  nucleus 
flattened  in  this  direction,  b  <  a  and  Q  <  0.  For  spherical 
distribution  of  the  charge  in  the  nucleus,  b  =  a  and  Q  —  0. 
This  is  valid  for  nuclei  with  spin  equal  to  0  or  to  (1/2)  K. 


42.2  Binding  Energy  of  Nuclei.  Mass  Defect 

42.2.1  In  nuclei,  nucleons  are  in  states  that  differ  essentially 
from  their  free  states.  With  the  exception  of  the  nucleus  of  ordi¬ 
nary  hydrogen,  all  nuclei  contain  at  least  two  nucleons,  between 
which  there  exists  a  special  nuclear  strong  interaction—  at¬ 
traction—  that  ensures  stability  of  the  nucleus  notwithstanding 
the  repulsion  of  the  protons  of  like  charge. 

42.2.2  The  binding  energy  of  a  nucleon  in  a  nucleus  is  the  phys¬ 
ical  quantity  equal  to  the  work  required  to  remove  the  nucleon 
from  the  nucleus  without  imparting  kinetic  energy  to  it. 

The  nuclear  binding  energy  is  the  amount  of  work  required  to 
take  the  nucleus  apart  into  its  component  nucleons  without 
imparting  any  kinetic  energy  to  them.  It  follows  from  the  law 
of  energy  conservation  that  in  forming  a  nucleus  the  same  amount 
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of  energy  is  released  as  is  required  in  splitting  the  nucleus 
into  its  component  nucleons.  The  nuclear  binding  energy  is  the 
difference  between  the  total  energy  of  all  the  free  nucleons  that 
make  up  the  nucleus  and  their  total  energy  in  the  nucleus. 
42.2.3  In  forming  a  nucleus,  its  mass  is  reduced:  the  mass  of  the 
nucleus  is  less  than  the  total  mass  of  its  component  nucleons. 
This  reduction  in  mass  in  the  formation  of  a  nucleus  is  due.  to 
the  release  of  the  binding  energy.  If  ASp  is  the  amount  of  energy 
released  in  forming  a  nucleus,  the  corresponding  mass  A m 
(5.7.6),  equal  to 


is  called  the  mass  defect  and  characterizes  the  reduction  in  the 
total  mass  in  forming  a  nudleusTrom  its  component  nucleons. 
If  a  nucleus  of  mass  mjg  is  formed  of  Zprotons  with  the  mass  mp 
and  of  (A  —  Z)  neutrons  of  mass  %,  then 

Am  =  Zmp  +  (A  —  Z)  —  m jj. 

Instead  of  the  mass  mu  of  the  nucleus,  the  quantity  Am  can  be 
expressed  in  terms  of  the  atomic  mass  ma.  Thus 

Am  =  Z m.jj  -(■  (A  —  Z)  — •  ma, 

where  is  the  mass  of  the  hydrogen  atom. 

In  practice,  in  calculating  Am,  the  masses  of  all  the  particles 
and  atoms  are  expressed  in  atomic  mass  units  (Appendix  II). 
The  mass  defect  serves  as  a  measure  of  the  nuclear  binding  ener¬ 
gy.  Thus 

A E\)  =  A  me2  —  [Zmp  +  (A  —  Z)  —  wn!  e2- 

One  atomic  mass  unit  corresponds  to  one  atomic  unit  of  energy, 
which  is  equal  to  931.5016  MeV. 

42.2.4  The  specific  binding  energy  Afi"u  of  the  nucleus  is  the 
binding  energy  per  nucleon:  A E™  =  A E\J A .  The  quantity 
A#£u  equals  8  MeV/nucleon  on  an  average.  A  curve  showing 
the  dependence  of  the  specific  binding  energy  on  the  mass  num¬ 
ber  A  and  characterizing  the  different  bonding  strengths  for 
the  nucleons  in  the  nuclei  of  various  chemical  elements,  is 
given  in  Fig.  42.1.  The  nuclei  of  elements  in  the  middle  of  the 
periodic  system  (39.3.5)  (28  <  A  <  138),  i.e.  from  ffSi  to 
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1||Ba  have  the  strongest  bonds.  In  these  nuclei  AA’j|u  is  close 
to  8.7  MeV/nucleon.  As  the  number  of  nucleons  in  the  nucleus 
increases,  the  specific  binding  energy  decreases.  The  atomic 
nuclei  of  chemical  elements  located  at  the  end  of  the  periodic 
system  (for  instance,  the  nucleus  of  uranium)  have  the  binding 


Fig.  42.1 

energy  A/i",11  «  7.6  MeV/nucleon.  This  explains  the  feasibility 
of  releasing  energy  by  the  fission  of  heavy  nuclei  (42.9.2).  In 
the  region  of  low  mass  numbers  there  are  sharp  peaks  of  specific 
binding  energy.  Maxima  are  typical  for  nuclei  with  even  num¬ 
bers  of  protons  and  neutrons  (|FIc,  1|C,  1|0).  Minima  are  typical 
for  nuclei  with  odd  numbers  of  protons  and  neutrons  (sLi,  0B,  ,N). 
42.2.5  Data  on  the  binding  energy  of  the  nucleus  and  an  appli¬ 
cation  of  the  liquid-drop  model  of  the  nucleus*  enabled  certain 
laws  of  nuclear  structure  to  be  established. 

*  The  liquid-drop  model  of  the  nucleus  is  discussed  in  the  phys¬ 
ics  course  of  secondary  schools. 
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The  atomic  nuclear  stability  criterion  is  the  relation  between  the 
numbers  of  protons  and  neutrons  in  a  stable  nucleus  for  the 
given  isobars  (42.1.3)  (i.e.  A  =  const).  The  condition  for  mi¬ 
nimum  energy  of  the  nucleus  yields  the  following  relation  be¬ 
tween  Z,.x  and  A: 

Zst  =  1.98+0-015A2/3  * 

The  stable  atomic  mhmbefs^Zg,.  are  the  nearest  whole  numbers 
to  those  obtained  by  this  formula. 

At  low  and  medium  values  of\l  there  are  approximately  equal 
numbers  of  protons  and  neutrons  in  stable  nuclei:  N  -x  Z  or 
A  «  2Z.  Especially  stable  are  nuclei  (when  Z  =  A/2)  for 
which  N  or  Z  —  2,  8,  20,  28,  50,  82  or  126.  These  have  been 
called  magic  numbers.  Nuclei  in  which  both  N  and  Z  are  magic 
numbers  are  said  to  be  double-magic. 

As  Z  increases,  the  Coulomb  forces  of  repulsion  between  the 
protons  increases  in  proportion  to  Z  (Z  —  1)  «  Z2  (pairwise 
proton  interaction),  and,  to  compensate  for  this  repulsion 
by  nuclear  attraction,  the  number  of  neutrons  must  increase 
more  rapidly  than,  the  number  of  protons. 


42.3  Nuclear  Forces 

42.3.1  Nuclear' interaction  indicates  that  special  forces  exist 
in  nuclei  that  do  not  belong  to  any  of  the  types  of  forces  known 
in  classical  physics  (e.g.  gravitational  and  electromagnetic 
forces). 

42.3.2  Nuclear  forces  are  of  short-range  action.  They  only  act 
at  the  extremely  short  distances,  of  the  order  of  10-15  m,  between 
nucleons  in  a  nucleus.  This  distance,  ranging  from  1.5  to  2.2  X 
X  10 ~18  m,  is  called  the  action  radius,  or  range,  of  the  nuclear 
forces. 

42.3.3  Nuclear  forces  display  charge  independence :  the  attrac¬ 
tion  between  any  two  nucleons  is  the  same  and  is  independent  of 
the  charge  state  of  the  nucleons,  regardless  of  whether  they 
are  protons  or  neutrons  (42.1.1).  The  charge  independence  of 
nuclear  forces  follows  from  a  comparison  of  the  binding  energy 
in  mirror  nuclei.  These  are  nuclei  having  the  same  total  number 
of  nucleons,  but  the  number  of  protons  in  one  is  equal  to  the 
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number  of  neutrons  in  the  other.  The  nuclei  of  helium  pie  and 
tritium  fT  (superheavy  hydrogen)  can  be  taken  as  an  example. 
The  binding  energies  (42.2.4)  in  these  nuclei  are  7.72  MeV  and 
8.49  MeV,  respectively.  Assuming  charge  independence  of  nuclear 
forces,  the  difference  in  the  nuclear  binding  energies,  equal 
to  0.77  MeV,  is  associated  with  the  energy  of  Coulomb  repulsion 
between  two  protons  in  the  nucleus  of  |He.  Assuming  this  quan¬ 
tity  equal  to  e2/4jte0r  (16.2.6),  the  average  distance  r  between 
the  protons  in  the  |He  nucleus  is  found  to  equal  1.9  X  10-16  m. 
This  is  in  agreement  with  the  action  radius  of  nuclear  forces 
(42.3.2). 

42.3.4  Nuclear  forces  have  the  property  of  saturation.  This 
is  manifested  in  the  fact  that  a  nucleon  in  the  nucleus  interacts 
with  only  a  limited  number  of  the  closest  nucleons  among  its 
neighbours.  Besides  the  short-range  action  of  nuclear  forces 
(42.3.1),  this  saturation  also  follows  from  the  linear  dependence 
of  the  binding  energy  in  nuclei  on  the  mass  number  A  (42.1.3). 
If  each  nucleon  in  a  nucleus  interacted  simultaneously  with 
all  (A  —  1)  nucleons  in  the  nucleus,  the  binding  energy  of  the 
nucleus  would  be  proportional  to  the  possible  number  of  inter¬ 
acting  pairs  of  nucleons  in  the  nucleus,  i.e.  the  number  of 
combinations  of  A  taken  two  at  a  time:  A  (A  —  l)/2*.  The 
dependence,  in  this  case,  of  the  binding  energy  on  A  would 
be  quadratic,  rather  than  linear,  and  this  is  in  contradiction  with 
the  experimental  data.  Practically  total  saturation  of  nuclear 
forces  is  reached  for  the  alpha  particle  (42.4.2),  which  is  a  stable 
structure. 

42.3.5  Nuclear  forces  depend  upon  the  orientation  of  the  spins 
of  the  interacting  nucleons.  This  is  confirmed  by  the  different 
kinds  of  scattering  of  neutrons  by  the  molecules  of  ortho-  and 
parahydrogen.  In  a  molecule  of  orthohydrogen,  the  spins  of 
the  two  protons  are  parallel  to  each  other;  in  a  molecule  of  para- 
hydrogen  they  are  antiparallel.  If  the  interaction  of  a  neutron 
with  a  proton  was  independent  of  the  mutual  orientation  of 
their  spins,  the  scattering  of  neutrons  by  molecules  of  either 
orthohydrogen  and  parahydrogen  would  be  the  same,  even  though 
in  the  case  of  parahydrogen  the  spin  of  the  neutron  is  paral- 
lel  to  that  of  one  of  the  protons,  and  in  the  case  of  orthohydro¬ 
gen  the  spin  of  the  neutron  does  not  necessarily  coincide  with 
the  spins  of  the  protons.  Experiments  showed  that  the  scattering 


*  Assuming,  of  course,  pairwise  nuclear  interaction. 
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of  neutrons  by  parahydrogen  is  30  times  more  than  the  scattering 
by  orthohydrogen.  This  is  a  clear  indication  of  the  dependence 
of  nuclear  forces  on  the  orientations  of  the  spins  of  the  interact¬ 
ing  nucleons. 

Nuclear  forces  are  not  central  forces  (3.3.4).  They  cannot  be 
represented  as  ones  actmg  along  lines  joining  the  centres  of  the 
interacting  nucleons.  \ 

42.3.6|No  rigorous  theory  exists  that  can  comprehensively 
explain  the  nature  and  laws  of  nuclear  forces.  One  of  the  simplest 
theories  is  the  exchange  interpretation  of  nuclear  forces,  which 
are  also  called  exchange  forces,  or  forces  arising  in  an  exchange 
interaction.  This  theory  is  based  on  the  idea  that  interaction  be¬ 
tween  two  particles  is  accomplished  by  the  exchange  of  a  third 
particle.  The  exchange  theory  of  nuclear  forces  assumes  that  the 
interaction  between  the  nucleons  of  a  nucleus  is  due  to  the  ex¬ 
change  of  special  particles  called  pions  (43.3.4)  and  having  the 
rest  mass  mn.  The  time  At  during  which  nucleons  exchange  pions 
(i.e.  the  time  interval  from  the  emission  of  a  pion  by  one  nucleon 
and  its  absorption  by  another),  should  not,  according  to  the 
Heisenberg  relation  (38.6.5),  exceed  At  111 AE,  where  A E 
is  the  indeterminacy  in  the  value  of  the  nucleon  energy.  The 
quantity  AE  is  related  to  the  rest  energy  En  of  the  new  particle 
created  in  the  region  of  the  action  radius  R  of  the  nuclear  forces 
(42.3.2).  Assuming  that  AE  =  En  =  mnc 2,  At  »  hlrnnc 2.  If 
it  is  assumed  that  the  pion  leaves  the  nucleon  at  the  velocity 
vn  x  c,  it  travels,  during  the  time  At,  the  distance 


R  n  —  At  —  c 


_h _ 

m„c2 


h 

mnc  ‘ 


It  is  quite  natural  to  assume  further  that  the  pion  travels  a  dis¬ 
tance  equal  to  R  during  the  time  At,  i.e.  Rn  =  R  1.5  X 
X  10-16  m  (42.3.2).  Consequently,  the  mass  of  the  pion  mn  is 
equal  to  mn  ~  hlRnc  «  250  me,  where  me  is  the  rest  mass  of 
the  electron.  This  agrees  well  with  experimental  data  on  the 
masses  of  pions  (see  Table  43.1). 


42.4  Radioactivity 

42.4.1  Radioactivity  is  the  transformation  of  unstable  isotopes 
(42.1.3)  of  one  chemical  element  into  isotopes  of  another  element, 
accompanied  by  the  emission  of  certain  particles.  Radioactivity 
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TABLE  42.1 


Radioactivity 

Change  in 
nuclear 
charge  Z 

Change  in 
mass 

number  A 

Essence  of  the  process 

Alpha  decay 

Z-  2 

A- 4 

Emission  of  an  alpha 
particle,  which  is  a 
system  of  two  pro¬ 
tons  and  two  neutrons 
bound  together. 

Beta  decay 

Z±  1 

A 

Mutual  conversions  in 
the  nucleus  of  a  neu¬ 
tron  (J11)  into  a  proton 
(}p)  and  vice  versa 

P-  decay 

Z-i-1 

A 

Jn^ip-K-ie  +  X) 

(5+  decay 

Z-l 

A 

iP  on  +  (+?e  oVe) 

Electron  cap¬ 
ture  ( e  or  K 
capture) 

Z- 1 

A 

lP  +  -‘,'e^Jn  +  (2veh 
where  J}ve  and  j}ve 
are  the  electron  neu¬ 
trino  and  antineu¬ 
trino. 

Particles  omitted 

from  the  nucleus  are 
enclosed  in  paienthc- 
ses. 

Spontaneous 

fission 

Z  — (1/2)  Z 

A- {HI)  A 

Fission  of  the  nucleus 
into  two  fragments 
of  approximately 

equal  mass  and  charge. 

Proton  radio¬ 
activity 

Z-l 

A-l 

Emission  of  a  proton 
from  the  nucleus. 

Double  proton 
radioactivity 

Z-l 

A— 2 

Simultaneous  emis¬ 
sion  of  two  protons 
from  the  nucleus. 
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is  also  the  mutual  conversion  of  certain  elementary  particles 
into  other  ones  (for  instance,  the  conversion  of  a  neutron  into 
a  proton  with  the  creation  of  an  electron  and  an  electron  anti- 
neutrino)  (A2Jkiy. 

Natural  radioactivity  is  that  observed  in  the  unstable  isotopes 
that  exist  inxnature. 

Artificial,  or  mduced,  radioactivity  is  that  acquired  by  isotopes 
as  a  result  of  nuclear  reactions  (42.9.1). 

42.4.2  The  main  kinds  of  radioactivity  are  listed  in  Table  42.1. 

42.4.3  Usually,  all  kinds  of  radioactivity  are  accompanied  by 
the  emission  of  gamma  rays,  which  is  hard,  short-wave  electro¬ 
magnetic  radiation.  Gamma  rays  are  the  main  form  that  reduces 
the  energy  of  excited  products  of  radioactive  transformations. 
A  nucleus  subject  to  radioactive  decay  is  called  a  parent  nu¬ 
cleus;  the  result  of  decay  is  called  a  daughter,  or  product,  nucleus 
and,  as  a  rule,  is  found  to  be  excited.  Its  transition  to  the  ground 
state  is  accompanied  by  the  emission  of  a  y  photon. 

42.4.4  Spontaneous  decay  of  atomic  nuclei  obeys  the  radioac¬ 
tive  decay  law: 

N  =  N0e~U, 

whore  N0  is  the  number  of  nuclei  in  the  given  volume  at  the 
initial  instant  of  time  t  =  0,  N  is  the  number  of  nuclei  in  the 
same  volume  at  the  instant  of  time  t,  and  X  is  the  decay  constant, 
which  is  the  probability  that  the  nucleus  will  decay  in  one 
second  and  is  equal  to  the  fraction  of  the  nuclei  that  decays  in 
unit  time. 

The  law  of  spontaneous  radioactive  decay  is  based  on  two  assump¬ 
tions:  (1)  the  decay  constant  is  independent  of  external  con¬ 
ditions,  and  (2)  the  number  of  nuclei,  which  decay  during  the 
time  di,  is  proportional  to  the  number  of  nuclei  present.  These 
assumptions  signify  that  radioactive  decay  is  a  statistical  pro¬ 
cess  and  the  decay  of  any  given  nucleus  is  a  chance  event,  having 
a  definite  probability.  The  quantity  1/X  is  the  average  life  (or 
lifetime)  of  a  radioactive  isotope.  As  a  matter  of  fact,  the  total 
lifetime  dN  of  a  nucleus  is  equal  to  t  |  dN  |  —  iXN  di.  The 
average  life  t  of  all  initially  existing  nuclei  is 

oo  oo 

't==Jk\  %mdi  =  x  J  ““***  =  T- 

0  0 
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42.4.5  A  characteristic  of  nuclear  stability  against  decay  is  the 
radioactive  half-life  (half-value  period)  Tr/2.  This  is  the  length 
of  time  after  which  half  of  the  initial  number  of  nuclei  of  the 
given  radioactive  substance  undergoes  decay.  The  relation 
between  X  and  3T1/2: 


„  In  2  0.693 


0.693t. 


42.4.6  Natural  radioactivity  (42.4.1)  is  observed  for  the  nuclei 
of  atoms  of  chemical  elements  that  are  located  beyond  lead  in 
Mendeleev’s  periodic  system  (39.3.5).  Natural  radioactivity 
of  light  and  medium-weight  nuclei  is  observed  only  for  the 
nuclei  ?j}K,  »?Rb,  ^pn,  ^fLa,  ^Sm,  ^fLu  and  ^Re. 

The  law  of  the  conservation  of  electric  charges  (14.1.3)  in  nu¬ 
clear  radioactive  decay  is 

%Ne  =  V  Zie, 
i 


where  Z^e  is  the  charge  of  the  parent  nucleus  (42.4.3),  and  Zte 
is  the  charge  of  the  nuclei  and  particles  created  as  a  result  of 
the  radioactive  decay.  This  law  is  also  applicable  in  investi¬ 
gating  all  nuclear  reactions  (42.9.1).  The  rule  for  the  conser¬ 
vation  of  the  mass  numbers  (42.1.3)  in  natural  radioactive  phe¬ 
nomena  is 

^N=  V 

where  An  is  the  mass  number  of  the  parent  nucleus,  and  A  ; 
is  the  mass  number  of  the  nucleus  or  particles  obtained  as  a 
result  of  radioactive  decay. 

42.4.7  Soddy's  displacement  (Fajans-Soddy)  law  for  radioactive 
a  and  fS~  decays: 

zX-h- 2~|Y  -|-^He  for  a  decay 
zx^-z+iY  +  -ie  *or  decay. 

Here  is  the  parent  nucleus,  Y  is  the  symbol  for  the  daughter 
nucleus,  |He  is  a  helium  nucleus,  _$e  is  the  symbol  for  designat¬ 
ing  an  electron,  for  which  A  =  0  and  Z  =  —1. 

If  the  daughter  nucleus  is  also  radioactive,  a  radioactive  series, 
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or  chain,  of  transformations  is  formed.  Natural  radioactive 
nuclei  constitute  three  radioactive  families,  called  the  uranium 
series  (“fU),  thorium  series  (2jJ§Th)  and  the  actinium  series 
(2pAc).  They  are  named  after  the  longest-lived  parent  isotope 
of  the  series  with  the  maximum  half-life  (42.4.5).  After  under¬ 
going  a  sequence  of  a  and  f5_  decays,  all  the  families  terminate 
in  one  of  the  stable  nuclei  of  lead  isotopes:  2g|Pb,  2g|Pb  and 
2{J|Pb.  The  neptunium  series,  beginning  with  the  transuranium 
element  neptunium  2j]|Np,  is  obtained  artificially  (being  a  man¬ 
made  element)  and  terminates  in  bismuth  2g9Bi. 

42.4.8  If  there  is  a  whole  series  of  radioactive  decays  and,  during 
the  time  dt,  /.pArp  dt  nuclei  decay  from  the  total  number  A'p 
of  parent  nuclei,  whereas  X^N(\  dt  daughter  nuclei  decay  during 
the  same  time,  the  total  change  dNc\  in  unit  time  in  the  number 
of  nuclei  of  the  daughter  substance  is  expressed  by  the  relation 

-^  =  MrP-Mrd- 

In  the  case  of  dynamic  equilibrium  between  the  parent  and  daugh¬ 
ter  substances  dN(\/dt  =  0,  and  the  conditions  of  radioactivity 
equilibrium  are  complied  with.  Thus 

^p-^P  =  ^(1-^  d> 
from  which 

fy  p  _  aci  _  t  p 
A^d  ~  Xp  “  rd  > 

where  Tp  and  Td  are  the  half-lives  (42.3.5)  of  the  parent  and 
daughter  nuclei.  The  product  A  =  XN  is  called  the  activity 
of  the  given  radioactive  substance  and  represents  the  number 
of  decays  in  unit  time.  Activity  referred  to  unit  mass  of  the 
substance  is  called  specific  activity.  Activity  is  measured  by 
the  number  of  nuclei  of  a  radioactive  substance  that  decays 
in  unit  time  (s-1).  The  unit  of  activity  is  called  the  becquerel 
(Bq). 

42.5  Alpha  Decay 

42.5.1  Alpha  decay  is  the  emission  of  alpha  particles  by  the 
nuclei  of  certain  chemical  elements.  Alpha  decay  (42.5.2)  is 
a  property  of  heavy  nuclei  having  mass  numbers  A  >  200 
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and  nuclear  charges  Ze  >  82.  Formed  inside  such  nuclei  are 
separate  alpha  particles,  each  consisting  of  two  protons  and 
two  neutrons.  This  is  facilitated  by  the  saturation  of  nuclear 
forces  (42.3.4).  The  formed  alpha  particles  are  subject  to  more 
Coulomb  force  repulsion  from  the  protons  of  the  nucleus  than 
single  protons  are.  At  the  same  time,  an  alpha  particle  is  sub¬ 
ject  to  less  nuclear  attraction  to  the  nucleons  in  the  nucleus  (han 
single  nucleons. 

42.5.2  The  nucleus  is  a  potential  barrier  (38.7.1)  for  alpha  par¬ 
ticles  and  its  height  U  is  greater  than  E,  the  energy  of  the  alpha 
particle  in  the  nucleus.  Alpha  decay  occurs  by  the  penetration 
of  alpha  particles  through  the  potential  barrier  by  means  of 
the  tunnel  effect  (38.7.2).  The  formula  for  the  penetrability  D 
of  the  potential  barrier  indicates  that  even  negligible  changes 
in  the  energy  of  an  alpha  particle  in  the  nucleus  lead  to  a  drastic 
change  of  the  quantity 

exp  {  -  T  j  Y*ma[U(x)-R\  d.r}  . 

Xj 

This  explains  the  great  differences  in  the  half-lives  of  alpha 
radiators,  from  109  years  to  10~7  s,  at  a  comparatively  small 
increase  in  the  energy  of  the  alpha  particles,  from  4  to  9  MeV. 

42.5.3  The  decay  constant  X  (42.4.4)  is  related  to  the  penetra¬ 
bility  D  of  the  potential  barrier  for  alpha  particles.  For  a  sim¬ 
plified  model  of  a  potential  barrier  of  length  L  (38.7.2) 

X  =  Dn, 

where  n  is  the  number  of  collisions  of  the  alpha  particle  with 
the  barrier  wall  in  unit  time,  equal  to  n  =  vl2L,  and  v  = 
=  Y 2 Elma  is  the  velocity  of  the  alpha  particle  in  the  nucleus. 
The  quantity  L  is  commonly  taken  equal  to  the  radius  R  of 
the  nucleus  (2 R  is  the  width  of  the  potential  barrier). 

The  simplified  formula  for  the  alpha-radioactive  decay  constant 
is 

^=lf  /  -^exp  [  (U0- £)/?]. 

42.5.4  The  Geiger-N uitall  rule,  or  relation,  states  that  the 
larger  the  decay  constant  X  of  a  radioactive  element,  the  longer 
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the  range  Ra  (path  length)  of  the  alpha  particles  it  emits  in 
air.  Thus 

In  X  ~  A  B  In  7?a, 

where  A  and  B  are  empirical  constants,  having  different  values 
for  the  various  radioactive  families  (42.4.7). 

42.5.5  On  the  basis  of  data,  which  is  given  in  Sects.  42.5.3 
and  42.5.4,  it  has  been  shown  that  the  same  alpha-radioactive 
element  has  several  groups  of  alpha  particles  with  different 
ranges  (path  lengths).  Constancy  of  the  ranges  is  observed 
within  each  group.  It  follows  that  the  alpha  particles  emitted 
by  the  nuclei  have  a  definite  energy  spectrum  and,  consequently, 
the  atomic  nuclei  have  discrete  energy  levels. 


42.6  Beta  Decay 

42.6.1  The  concept  of  beta  decay  unites  three  kinds  of  nuclear 
transformations:  electron  decay  (fi  ~) ,  positron  decay  (jl +)  and 
electron  capture  ( e  or  K  capture).  The  first  two  kinds  of  trans¬ 
formations  consist  in  the  emission  by  the  nucleus  of  an  electron 
(or  positron)  and  an  electron  antineutrino  (or  electron  neutrino). 
These  processes  involve  the  transformation  of  one  kind  of  nu¬ 
cleon  in  the  nucleus  into  another:  a  neutron  into  a  proton  or  a 
proton  into  a  neutron.  The  transformations  occur  as  follows: 

on  Jp  +  _?e  +  (jve  (p“  decay), 

]p—  Jn-r+Je  +  gvc  $+  decay). 

Here  Jn  and  }p  are  the  symbols  for  the  neutron  and  proton,  _?c 
and  +Je  are  the  symbols  for  the  electron  and  positron,  and  gve 
and  are  the  symbols  for  the  electron  neutrino  and  antineu¬ 
trino  (42.4.2). 

In  the  case  of  e  capture,  the  proton  is  transformed  into  a  neutron 
as  follows: 

iP+-ie  -*■  orl  +  ove- 

This  procedure  consists  in  the  disappearance  of  one  of  the  elec¬ 
trons  from  the  K  shell  of  the  atom,  which  is  closest  to  the  nu¬ 
cleus.  The  proton,  in  being  transformed  into  a  neutron,  seems 
to  capture  an  electron.  This  gave  rise  to  the  term  electron  cap- 
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ture  (or  e  capture).  A  feature  of  this  kind  of  beta  decay  is  the 
emission  from  the  nucleus  of  only  a  single  particle,  the  neutrino 
gve.  An  example  of  e  capture  is  the  transformation  of  the  radio¬ 
active  nucleus  of  beryllium  |Be  into  a  stable  nucleus  of  lithium 
?tLi.  In  contrast  to  P*  decay,  electron  capture  is  accompanied  by 
characteristic  X  radiation,  belonging  to  the  K  line  of  the  cor¬ 
responding  element  (39.7.9). 

42.6.2  decay  occurs  in  both  natural  and  artifical  radioactive 
nuclei;  P  +  decay  is  typical  of  only  artificial  radioactivity,  i.e. 
the  initiation  of  intrinsic  radioactive  radiation  of  nuclei  due 
to  the  action  of  alpha  particles,  neutrons  or  other  particles. 
This  violates  the  conditions  of  stability  of  the  atomic  nucleus 
(42.2.5).  For  example,  the  artificially  radioactive  isotope  of 
carbon  lj|C  is  obtained  from  the  stable  nucleus  of  nitrogen  by 
the  action  of  neutrons  and  with  the  emission  of  protons: 

*!N  +  Jn-‘jC  +  ‘p 

and,  after  undergoing  beta  decay,  is  again  transformed  into 
the  stable  isotope  1SN.  Thus 

‘iC-^N  +  ^e  +  Jve. 

Violation  of  the  stability  conditions  (42.2.5)  by  introducing 
surplus  protons  into  the  nucleus  leads  to  artificial  P  +  decay. 
This  is  evident  from  the  following  example: 

‘“B  +  iHe-^N-^fN  +  Jn, 

‘?N  ->  ‘gC  +  ^e  +  Sve, 

where  +'{e  is  a  positron  and  JJve  is  an  electron  neutrino. 

42.6.3  Natural  P-  decay  takes  place  so  that  the  neutron  Jn 
is  spontaneously  transformed  into  a  proton.  The  rest  energy 
(5.7.3)  of  the  neutron  exceeds  that  of  a  hydrogen  atom  (i.e. 
the  proton  and  electron  taken  together)  by  782  keV.  Hence, 
a  transformation  of  the  type  (42.6.1) 

on  -*■  }p  +  -?e  +  8ve 

is  feasible  with  respect  to  energy  outside  the  nucleus  as  well. 
The  radioactive  decay  of  free  neutrons  has  been  observed  in  high- 
intensity  streams  of  neutrons  released  in  nuclear  reactors. 
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This  decay  takes  place  with  a  half-life  (42.4.5)  of  (1.01  + 
-b  0.03)  X  103  s.  In  heavy  nuclei,  overloaded  with  neutrons, 
such  transformations  lead  to  P~  natural  radioactivity. 
Transformations  (42.6.1)  of  the  type  ]p  Jn  +  +je  +  Jve  are 
only  possible  in  nuclei  where  the  required  energy  is  borrowed 
from  neighbouring  particles.  This  transformation  leads  to 
artificial  P  +  decay. 

42.6.4  The  half-lives  of  beta  decays  (42.4.5)  vary  for  various 
sources  of  P*  radioactive  radiation  in  an  extensive  time  range 
from  10 ~2  s  to  1012  years,  which  is  incommensurably  long  com¬ 
pared  to  nuclear  time  (1 0-22  to  10~23  s)  (43.2.6).  This  indicates 
that  beta  decay  is  due  to  weak  interaction  (43.2.8). 

42.6.5  The  decisive  experimental  fact  in  understanding  the 
mechanism  of  p-  decay  and  establishing  a  suitable  theory  was 


an  investigation  of  the  energy  spectrum  of  the  emitted  electrons. 
This  spectrum  was  found  to  bo  continuous,  extending  to  E  — 
=  Fjma\  (Fig-  42.2).  The  energy  E^ax  is  called  the  upper  energy 
limit  of  the  p-  spectrum;  it  is  a  characteristic  of  the  source  of 
P”  radioactive  radiation.  Electrons  with  an  energy  exceeding 
£max  cannot  be  obtained  with  the  given  source. 

42.6.6  To  reconcile  the  continuity  of  the  electron  energy  spec¬ 
trum  with  the  discrete  nature  of  the  energy  levels  of  the  nucleus 
(42.5.5),  it  is  necessary  to  assume  that,  together  with  the  elec- 
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troii  _“e,  another  particle,  the  electron  antineutrino*  [jvt,,  *s 
emitted  from  the  nucleus.  The  total  energy  lost  by  the  nucleus 
in  a  beta  decay  is  equal  to  -A',nax,  but  it  is  variously  distributed 
between  the  electron  and  electron  antineutrino**.  In  particular, 
the  limit  point  of  the  curve  in  Fig.  42.2  means  that  all  the  energy 
of  the  decay  is  carried  away  by  the  electron.  Zero  energy  of 
the  electron  on  the  curve  would  correspond  to  all  the  energy 

being  carried  away  by  the  antineutrino  Jjve. 

For  the  p~  radioactivity  of  free  neutrons  Em ax  =  782  keV, 
which  fully  corresponds  to  that  discussed  in  Sect.  42.6.3. 
42.6.7  Neither  the  mass  number  A  nor  the  spin  of  the  nucleus 
(42.1.5)  changes  in  a  beta  decay.  The  conservation  of  spin  is 
explained  by  the  fact  that  the  spin  of  the  electron  antineutrino 
is  Til 2  (see  Table  43.1),  and  the  combined  spins  of  this  particle 
and  of  the  electron,  also  equal  to  hi 2,  are  equal  to  zero  when 
the  spins  of  the  two  particles  are  oppositely  oriented. 


42.7  Gamma  Rays 

42.7.1  Gamma  rays  constitute  hard  electromagnetic  radiation 
whose  energy  is  released  in  nuclear  transitions  from  excited 
energy  states  to  the  ground  or  less  excited  states  (42.5.5),  as 
well  as  in  nuclear  reactions.  In  the  first  case,  according  to  the 
Bohr  frequency  condition  (39.1.6),  the  energy  of  a  photon  of 
gamma  rays  is  equal  to  the  difference  in  the  energies  of  the 
final  and  initial  energy  levels  of  the  nucleus: 

=  Ei  —  Eh  —  &Eih, 

where  vih  is  the  frequency  of  the  photon  corresponding  to  the 
transition  of  the  nucleus  from  the  state  with  energy  Et  to  the 
state  with  energy  £*.  The  magnitude  of  A Eik  is  of  the  order  of 
0.1  MeV  and  considerably  exceeds  the  difference  of  the  ener¬ 
gies  of  the  electron  levels  in  the  atom  (39.1.5).  Gamma  rays  arc 
extremely  short-wave  electromagnetic  radiation  of  a  wavelength 
not  exceeding  10~2  nm,  i.e.  0.1  A.  The  discrete  line  spectrum  of 


*  Antiparticles  are  discussed  in  Sect.  43.5.1. 

**  It  is  evident  that  E  m;,x  determines  the  difference  A Elh  in 
the  energies  of  two  levels  of  the  nucleus. 
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gamma  rays  is  confirmation  of  tho  aforesaid  (see  Sect.  42.7.4). 

42.7.2  Gamma  radiation  is  not  an  independent  type  of  radio¬ 
activity  (42.4.2).  It  accompanies  the  processes  of  a  and  |3  de¬ 
cays  and  causes  no  changes  in  the  charge  or  mass  number  of 
nuclei.  It  has  been  established  that  gamma  rays  are  emitted 
by  a  daughter  nucleus  (42.4.3)  which  turns  out  to  be  excited  at 
the  instant  it  is  formed  (42.7.4).  The  excited  nucleus  loses  its 
energy  during  a  time  ranging  from  10~13  to  1()-14  s,  which  is 
considerably  loss  than  the  lifetime  of  the  excited  atom  (approx. 
10-8  s). 

42.7.3  The  origin  and  properties  of  gamma  rays  are  confirmed 
by  the  laws  of  internal  conversion  of  gamma  rays,  which  consists 
in  photoelectric-effect  phenomena  (37.1.1)  of  the  electrons  of 
the  atom’s  inner  shells  as  a  result  of  gamma  radiation  of  its 
nucleus.  Electrons  formed  by  such  an  internal  photoelectric 
effect  arc  said  to  be  conversion  electrons. 

In  many  cases  all  the  energy  of  the  gamma  rays  is  expended  on 
internal  conversion  phenomena,  and  only  conversion  electrons 
are  observed  instead  of  gamma  radiation.  Tho  energy  eVn  of 
a  conversion  photoclecfron  is  related  to  the  energy  hv  of  a  pho¬ 
ton  of  gamma  rays  by  Einstein’s  photoelectric  emission  equa¬ 
tion  (37.1.4): 


cT  n  —  hv  ®  ji, 

where  <t»n  is  the  work  function  of  an  electron  from  tho  nth  elec¬ 
tron  shell  of 'the  atom,  numerically  equal  to  the  energies  of 
electrons  located  at  definite  energy  levels  in  the  atom  (39.1.5). 
These  energies  are  known  from  data  on  characteristic  X-ray 
spectra  of  atoms  (39.7.9).  A  gamma  photon  of  the  energy  hv 
can  remove  an  electron  from  any  inner  shell  (K,  M,  etc.)  of 
the  atom  (39.3.6).  Accordingly,  the  energies  of  tho  conversion 
electrons  are  expressed  by 

eV L  =  hv  —  <1>L,  eV vi  =  hv  —  <I>jW,  eV n  =  hv  —  O^,,  etc. 

where  d>£,,  d>iW,  <I5W,  etc.  are  the  X  ray  energy  levels  of  the 
atom. 

42.7.4  The  result  of  internal  conversion  (42.7.3)  is  the  loss  of 
electrons  from  the  inner  shells  of  the  atom  and,  consequently, 
the  provision  of  conditions  for  producing  the  X  ray  character¬ 
istic  spectrum  (39.7.9).  Internal  conversion  is  accompanied  by 
the  emission  of  characteristic  X  rays.  A  measurement  of  the 
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energy  of  the  conversed  electrons  and  the  data  on  the  quanti¬ 
ties  dijy,  etc.  (42.7.3)  enabled  it  to  be  shown  that  the 

nucleus  can  emit  a  definite  series  of  monochromatic  gamma  rays, 
i.e.  that  gamma  rays  have  a  line  spectrum.  Besides,  these  data 
showed  that  gamma  rays  are  emitted  by  a  daughter  nucleus 
rather  than  a  parent  one. 

42.7.5  Gamma  rays  have  a  very  strong  effect  on  matter,  in 
particular,  on  biological  substances.  The  effect  of  gamma  rays 
and  other  kinds  of  ionizing  radiation  is  evaluated  by  the  absor¬ 
bed  radiation  dose  D,  the  ratio  of  the  energy  of  radiation  to  the 
mass  of  the  irradiated  matter.  The  unit  of  the  dose  is  J/kg,  which 
is  the  dose  of  absorbed  radiation  at  which  a  mass  of  1  kg  of  irra¬ 
diated  matter  receives  1  joule  of  energy  of  ionizing  radiation*. 
This  unit  is  called  the  gray  (Gy).  Also  used  is  the  off-system 
unit  called  the  rad:  1  rad  =  10~2  Gy. 

The  dose  rate  N  of  radiation  is  the  absorbed  dose  D  referred  to 
unit  time.  Thus 

N  =  Dlt. 

The  unit  of  dose  rate  is  a  watt  per  kilogram:  W/kg  =  Gy/s. 

42.7.6  The  energy  characteristic  of  radiation,  evaluated  on 
the  basis  of  the  ionization  of  dry  atmospheric  air,  is  the  exposure 
dose  De.  It  is  measured  in  coulombs  per  kilogram  (C/kg).  One 
coulomb  per  kilogram  of  exposure  dose  of  X  or  gamma  radiation 
is  the  amount  at  which  the  sum  of  the  electric  charges  of  ions 
of  the  same  sign,  produced  by  the  electrons  released  by  irradia¬ 
ted  air  of  1  kg  mass,  fully  utilizing  the  ionizing  capacity,  is 
equal  to  one  coulomb.  An  off-system  unit  of  exposure  dose  is  the 
roentgen  It,  with  1  R  =  2.58  X  10~4  C/kg. 

With  an  exposure  dose  equal  to  1  It,  a  total  charge  of  ions  of 
the  same  charge  equal  to  (1/3)  X  10“9  C  is  obtained  in  10"°  m3 
of  dry  air  at  standard  atmospheric  pressure. 

The  exposure  dose  rate  Ne  is  the  exposure  dose  per  unit  time: 
N e  =  Djt,  measured  in  amperes  per  kilogram  (A/kg).  One 
unit  of  exposure  dose  rate  is  the  electromagnetic  radiation  at 
which  the  exposure  dose  increases  by  one  coulomb  per  kilogram 
in  one  second.  The  off -system  units  of  the  exposure  dose  rate 


*  Owing  to  the  specific  nature  of  information  on  the  biological 
effect  of  gamma  rays,  the  pertinent  units  of  physical  quantities 
are  given  hero  instead  of  in  the  Appendix. 


'2.8]  MOSSBAUER  EFFECT 


645 


are:  1  R/s  =  2.58  X  10~4  A/kg,  1  R/min  =  4.30  X  10~°  A/kg 
and  i  R/li  =  7.17  X  10~s  A/kg. 

42.7.7  A  radiation  dose  can  also  be  evaluated  on  the  basis  of 
its  biological  effect.  The  unit  applied  for  this  purpose  is  called 
the  roentgen  equivalent  man  (rom).  This  is  the  absorbed  energy 
of  radiation  that  produces  the  same  biological  effect  as  that  pro¬ 
duced  by  one  roentgen.  Thus 

1  rem  =  10~2  J/kg. 

The  safe  dose  rate  for  humans  is  considered  to  be  one  about 
250  times  that  produced  by  the  background  of  space  and  by  radio¬ 
active  radiation  from  the  earth’s  interior. 


42.8  Mossbauer  Effect 

42.8.1  All  excited  energy  levels  of  the  nucleus  have  energy 
values  determined  to  an  accuracy  of  A E,  which  is  determined 
from  the  indeterminacy  relation  (38.6.6).  Thus 


where  At  is  the  lifetime  of  the  nucleus  in  the  excited  state. 
Only  for  the  ground  state  of  a  stable  nucleus  is  At  —  oo  and 
A E  =  0,  i.e.  the  energy  value  of  the  nucleus  is  exactly  equal 
to  E* .  For  example,  a  nucleus  of  iridium  ^ilr  goes  over  from 
the  excited  state  with  the  energy  E  =  129  keV  to  the  ground 
state,  emitting  a  gamma  photon,  during  the  time  At,  which 
can  be  taken  equal  to  the  half-life  (42.4.5)  T  —  1(H°  s.  The 
energy  indeterminacy  AE  is  found  to  equal  A E  x  5  X  It)-6  eV. 
The  finite  lifetime  of  the  excited  energy  states  of  the  nucleus 
leads  to  a  lack  of  monochromaticity  of  the  gamma  radiation 
that  accompanied  the  transition  of  the  nucleus  from  the  excited 
to  the  ground  state.  This  lack  of  monochromaticity  is  called  the 
natural  linewidth  of  gamma  radiation,  and  the  indeterminacy 

*  This  is  equally  true  for  the  energy  levels  of  electrons  in  atoms. 
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A E  of  the  energy  of  an  excited  state  is  called  the  natural  width 
r  of  the  energy  level  of  the  nucleus*. 

42.8.2  Resonance  absorption  of  gamma  rays  by  nuclei  implies 
the  absorption  by  the  nuclei  of  gamma  photons  of  such  frequency 
v  that  the  energy  hv  of  a  photon  is  equal  to  the  difference  in  the 
energies  of  one  of  the  excited  states  and  the  ground  state  of  the 
nucleus.  The  name  means  that  the  line  of  the  gamma  photon, 
emitted  in  the  transition  of  the  nucleus  from  an  excited  state 
to  its  ground  state,  has  the  same  frequency  v. 

When  gamma  photons  are  emitted  or  absorbed  by  a  nucleus,  the 
recoil  of  the  nucleus  is  taken  into  account.  In  a  transition  of  a 
nucleus  from  an  excited  state  with  the  energy  E  to  the  ground 
state  (of  an  energy  taken  equal  to  zero),  the  gamma  photon 
acquires  the  energy  Upi,  equal  to 

/'vem  —  —“A1  Rj sj  <C  if, 

where  is  the  nuclear  recoil  energy. 

In  exciting  the  nucleus  and  its  transition  from  the  ground  state 
to  the  state,  with  the  energy  E,  the  gamma  photon  should  have 
the  energy  E')lx,  equal  to  =  E'lh  =  E  +  A’n  >  E. 

The  frequencies  at  the  maxima  of  the  emission  lines  (vem)  and 
absorption  lines  (vans)  are  shifted  with  respect  to  one  another 
by  the  amount 

vabs  —  vem  =  Av,  which  is  such  that  fcAv  =  2 E^. 

The  energy  E n  of  nuclear  recoil  can  be  determined  from  the 
momentum  ppj7  of  the  photon  (37.2.2),  which,  in  the  processes 
of  emitting  and  absorbing  a  gamma  photon,  should  equal  the 
momentum  of  the  nucleus  (pph  =  pn): 

„  Ph  Ppli  (  hv  4  2  1 

N  2M N  2Mn  {  c  )  2 iWN  ’ 

where  Mn  is  the  mass  of  the  nucleus.  For  the  nucleus  ]^Ir, 
with  the  energy  E  =  129  keV  of  its  excited  state,  calculations 
yield  E n  =  0.05  oV  and  a  shift  Av  —  2 E-^lh  of  the  maxima  of 


*  This  definition  also  holds  for  transitions  of  atoms  from  an 
excited  to  the  ground  state.  But  hero  the  lack  of  monochromatici¬ 
ty  is  called  the  natural  spectral  linewidth ,  and  the  indeterminacy 
is  the  natural  width  of  energy  levels  of  electrons  in  the  atom. 


42.8]  MOSSBAUER  EFFECT 


647 


omission  and  absorption  lines.  At  this,  foAv  =  0.1  eV,  which 
considerably  exceeds  the  natural  width  of  level  F  (42.8.1). 

42.8.3  A  drastic  reduction  in  the  energy  of  nuclear  recoil  in 
absorbing  and  emitting  gamma  rays  is  evident  when  these  pro¬ 
cesses  are  observed  in  nuclei  in  the  crystal  lattice,  i.e.  in  the 
bound  state.  Under  these  conditions,  the  momentum  and  energy 
are  imparted  to  the  crystal  lattice  as  a  whole,  rather  than  to  the 
single  nucleus  that  emits  (or  absorbs)  a  gamma  photon.  The 
mass  of  the  crystal  is  substantially  greater  than  that  of  a  nu¬ 
cleus,  and  the  loss  of  energy  E n  in  emission  or  absorption  of  gam¬ 
ma  rays  becomes  extremely  small.  In  this  case,  resonance  ab¬ 
sorption  and  emission  of  a  gamma  photon  of  a  strictly  definite 
frequency  v  is  observed  and  the  linewidth  is  comparable  to  the 
natural  width  (42.8.1).  The  phenomenon  of  resonance  absorp¬ 
tion  (or  emission)  of  gamma  rays  without  recoil  is  called  the 
Mossbauer  effect. 

42.8.4  The  Mossbauer  effect  is  made  use  of  in  nuclear  spectro¬ 
scopy  to  make  precise  measurements  of  the  energy  levels  of  atomic 
nuclei.  In  the  gamma  transition  in  57Fe  nuclei,  for  instance, 
with  the  energy  of  the  transition  E  —  14.4  keV,  the  change  in 
the  energy  of  the  level  is  determined  with  an  accuracy  to  VfE  -= 
—  3  X  10-13.  For  the  gamma  transition  in  67  Zn  with  the  energy 
of  the  transition  E‘=  93  keV,  the  quantity  YlE  was  found  to 
equal  5  X  10~1(i. 

The  Mossbauer  effect  has  been  used  to  check  the  conclusion  on 
shift  of  spectral  line  frequencies  in  a  gravitational  field  (6.2.1). 
As  a  photon  travels  in  a  gravitational  field  its  energy  is  changed 
by  the  amount  A E  =  —m  (q)2  —  tpi),  where  m  =  hv/c 2  is  the 
mass  of  the  photon  (37.2.1),  and  qq  and  q2  are  the  potentials 
of  the  gravitational  field  at  points  1  and  2  (6.2.4).  The  minus 
sign  indicates  that  the  increase  in  the  energy  of  a  photon  in  a 
gravitational  field  is  at  the  expense  of  its  own  energy  E  —  hv: 

h  Av  =  — mAcp. 

The  relative  change  in  frequency  of  a  photon  as  it  passes  through 
a  gravitational  potential  difference  Aq>  is 

Av Arp 

v  c2  ’ 

Here  A(p  >  0  because  the  potential  of  zero  gravity  of  the  sun 
increases  with  the  distance  from  the  sun.  At  the  earth’s  surface 


648  STRUCTURE  AND  PROPERTIES  OF  ATOMIC  NUCLEI  [42 


it  is  greater  than  at  that  of  the  sun.  Consequently,  Av/v  <  0 
and  all  frequencies  of  spectral  lines  of  the  sun  and  stars,  record¬ 
ed  on  the  earth,  are  shifted  toward  the  red  end  of  the  spectrum. 
This  effect  is  called  the  gravitational  red  shift. 

The  Mossbauer  effect  enabled  the  gravitational  frequency  shift 
of  the  gamma  photon  to  be  observed  in  its  motion  in  the  earth’s 
gravitational  field.  In  vertical  motion  from  the  floor  to  the 
ceiling  of  the  laboratory  to  a  height  of  about  10  m,  the  relative 
change  in  frequency  is 

I  =  =  ^  10_15 

v  c2  c ^  y 

where  g  is  the  acceleration  due  to  gravity  (7.3.3).  To  record  this 
frequency  shift  it  is  necessary  to  provide  for  resonance  absorp¬ 
tion  of  gamma  photons  in  such  manner  that  the  source  and  de¬ 
tector  of  the  gamma  rays  have  a  relative  linewidth  equal  to 
or  less  than  10 ~15.  Then  there  will  he  no  absorption  if  the  fre¬ 
quency  of  the  gamma  photon  incident  on  the  nucleus  differs  from 
the  frequency  of  the  photon  that  the  nucleus  can  absorb  by  the 
amount  Av  =  10~15  v. 

This  experiment  was  conducted  with  two  identical  crystal 
sources  of  gamma  rays,  one  of  which  could  be  placed  at  a  height 
of  20  m  above  the  other.  When  the  detector  of  gamma  photons 
was  at  the  same  height  as  the  source  of  gamma  photons,  resonance 
absorption  occurred.  As  the  detector  was  raised  20  m  the 
absorption  ceased  owing  to  the  gravitational  shift  in  frequency. 
The  Doppler  effect  (30.6.1)  was  employed  to  restore  absorption. 
At  a  definite  velocity  at  which  the  detector  and  source  approach 
each  other,  the  Doppler  increase  in  frequency  exactly  com¬ 
pensated  for  its  gravitational  reduction,  and  resonance  absorp¬ 
tion  was  restored.  This  experiment  was  a  confirmation  under 
laboratory  conditions  of  the  gravitational  red  shift. 


42.9  Nuclear  Reactions 

42.9.1  Nuclear  reactions  are  transformations  of  atomic  nuclei 
brought  about  by  their  interactions  with  one  another  or  with 
elementary  particles  (43.1.1).  As  a  rule,  nuclear  reactions  in¬ 
volve  two  nuclei  and  two  particles.  One  pair,  a  nucleus  and  a 
particle,  is  said  to  be  the  initial ,  and  the  other,  the  final  pair. 
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Nuclear  reactions  are  represented  in  one  of  the  following  ways: 
A  +  a  -*■  B  +  b  or  A  (a,  b)  B, 

where  A  and  B  are  the  initial  and  final  nuclei,  and  a  and  b  are 
the  initial  and  final  particles  in  the  reaction.  In  many  cases 
a  nuclear  reaction  can  proceed  in  more  than  one  way,  the  pre¬ 
ceding  reaction  being  A  -f-  a  C  -|-  c,  i.e.  A  (a,  c)  C,  or  some 
other  version.  The  possible  ways  in  which  a  nuclear  reaction 
may  proceed  are  called  its  channels.  The  initial  phase  of  a  reac¬ 
tion  is  called  its  entrance  channel. 

42.9.2  A  nuclear  reaction  is  characterized  by  its  energy  Q  (called 
the  nuclear  reaction  energy),  which  is  equal  to  the  difference 
between  the  energies  of  the  final  and  initial  pairs  in  the  reaction 
(42.9.1).  When  Q  <  0,  the  reaction  proceeds  with  the  absorp¬ 
tion  of  energy  and  is  said  to  be  endothermic  (or  endoenergic)\ 
when  <?  >  0,  the  reaction  proceeds  with  the  release  of  energy 
and  is  said  to  be  exothermic  (or  exoergic).  An  endothermic  nu¬ 
clear  reaction  is  found  to  be  feasible  at  a  certain  minimum 
(threshold)  kinetic  energy  f?th  of  the  nuclei  or  particles  causing 
the  reaction: 


A’th  = 


Ma-YM.a 
Mk  V 


where  Mk  is  the  mass  of  the  stationary  (target)  nucleus,  and 
Mn  is  the  mass  of  the  particle  (or  nucleus)  bombarding  the 
target  nucleus.  ’ 

Nuclear  reactions  comply  with  the  laws  of  conservation  of  ener¬ 
gy,  momentum,  electrical  charge  and  mass  number.  If  the  kinet¬ 
ic  energy  of  the  particles  entering  a  reaction  is  sufficient  for 
the  creation  of  a  nucleon-antinucleon  pair  (43.5.6),  the  mass 
number  may  change.  Besides,  nuclear  physics  has  certain  con¬ 
servation  laws  not  found  in  other  branches  of  physics  (see  Sects. 
43.4.1  through  43.4.4). 

42.9.3  The  efficiency  of  a  nuclear  reaction  is  determined  by 
the  magnitude  of  the  effective  cross  section  a  of  the  given  reac¬ 
tion  (reaction  cross  section).  Quantity  cr  has  the  dimension  of 
area  and  characterizes  the  “yield”  of  the  reaction  per  particle 
irradiating  the  nucleus.  Thus 


dn0 

n0N<)  dx 


a  = 
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It  is  assumed  in  this  definition  that  a  plane-parallel  stream, 
containing  n0  particles  is  incident  in  unit  time  on  unit  area  of 
the  cross  section  of  a  substance  containing  N0  nuclei  in  unit 
volume;  dn0  is  the  number  of  the  particles  that  undergo  the 
nuclear  reaction  in  a  layer  of  the  thickness  dx. 

42.9.4  Depending  upon  the  nature  of  the  interaction  of  particle 
a  with  the  target  A,  distinction  is  made  between  direct  interac¬ 
tion,  in  which  nuclear  reactions  proceed  in  a  single  stage  accord¬ 
ing  to  some  process  (42.9.1)  and  nuclear  reactions  that  pro¬ 
ceed  in  two  stages  with  the  formation  of  a  compound  nucleus. 
In  the  first  stage  the  incident  particle  is  captured  by  the  target 
nucleus  and  the  particle’s  energy  is  not  imparted  to  any  partic¬ 
ular  nucleon,  but  is  shared  equally  by  all  the  particles  of  the 
compound  nucleus.  Thus  none  of  the  particles  gains  sufficient 
energy  to  be  ejected  from  the  nucleus*.  The  compound  nucleus 
is  dealt  with  as  an  excited  statistical  system  of  particles,  having 
random  motions  similar  to  those  of  particles  in  a  drop  of  liquid. 
Such  rapid  redistribution  of  energy  between  the  particles  of 
the  nucleus  is  possible  only  upon  frequent  collisions  of  the  par¬ 
ticles,  this  being  typical  for  the  redistribution  of  energy  between 
the  particles  in  a  drop  of  liquid .  As  the  result  of  chance  deviations 
from  uniform  distributions  of  the  excitation  energy  amongst 
the  particles  of  the  compound  nucleus,  sufficient  energy  is  con¬ 
centrated  on  one  of  them  to  be  ejected  from  the  nucleus.  This 
second  stage  of  the  nuclear  reaction  occurs  after  a  lapse  of  time 
(1()7  to  108)  tn  following  the  first  stage,  whore  tn  is  the  charac¬ 
teristic  nuclear  time  (43.2.6). 

The  sequence  of  a  nuclear  reaction  proceeding  with  the  formation 
of  a  compound  nucleus  is  as  follows: 

^JX  +  a-^Y-^C  +  b, 

where  is  the  initial  target  nucleus,  a  is  the  incident  particle, 
y2Y  is  the  compound  nucleus,  28C  is  the  product  nucleus  and  b 
is  the  particle  ejected  from  the  nucleus  as  a  result  of  the  reaction. 
If  a  —  b,  particles  are  being  scattered  by  the  nucleus  (scattering 
is  elastic  or  inelastic  according  to  whether  the  energy  of  the 
particles  is  the  same  or  different  before  and  after  scattering). 


*  The  “missile”  captured  by  the  nucleus,  for  instance,  an  alpha 
particle  or  deuteron,  can  consist  of  several  particles. 
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I  f  b  a  a  nuclear  reaction  is  proceeding  in  the  true  sense  of  the 
term. 

12.9.5  Nuclear  reactions  are  classified  according  to  various 
of  their  features:  according  to  the  energy  of  the  particles  that 
initiate  them,  according  to  the  energy  of  the  particles  partici¬ 
pating  in  them,  and  according  to  the  nature  of  the  nuclear  trans¬ 
formations  that  occur  in  them.  Nuclear  reactions  at  low  ener¬ 
gies  (of  the  order  of  1  eV)  proceed  mainly  with  the  participation 
of  neutrons.  Reactions  at  medium  energies  (up  to  several  MeV) 
arc  induced,  in  addition  to  neutrons,  by  various  charged  par¬ 
ticles  (alpha  particles,  protons,  deuterons,  nuclei  of  heavy  hyd¬ 
rogen,  etc.),  as  well  as  gamma  photons.  Charged  particles  that 
cause  nuclear  reactions  may  be  multiply  charged  ions  of  heavy 
chemical  elements,  as  well  as  charged  particles  accelerated  in 
accelerators.  Reactions  at  high  energies  (hundreds  and  thou¬ 
sands  of  MeV)  lead  to  the  creation  of  elementary  particle's  (me¬ 
sons,  hyperons,  etc.)  that  are  absent  in  the  free  state. 

42.9.6  The  following  are  examples  of  nuclear  reactions  initiat¬ 
ed  by  alpha  particles  and  deuterons  fD: 

(a)  historically,  the  first  nuclear  reaction  for  the  transformation 
of  nitrogen  into  oxygen  was  the  following: 

'}N  +  iHe-(‘gF)-«0  +  }p, 
or  in  shorter  notation: 

'?N  (a,  P)  *30; 

(b)  the  nuclear  reaction  in  which  neutrons  were  first  obtained 
was: 

|Be-HHe->-  »|C  |>; 

(«)  nuclear  reactions  initiated  by  deuterons  fD,  the  nuclei  of 
heavy  hydrogen  (deuterium),  can  lead  to  the  fusion  of  heavy 
nuclei,  tritium  fH,  or  the  light  isotope  of  helium  pie,  with  the 
formation  of  a  proton  or  neutron: 

fi)-|-fD  fH-|-  Jp  and  fD  f-fl)  -  |He  +  Jn 
(see  also  42.9.13). 

42.9.7  The  action  of  neutrons  Jn  leads  to  the  formation  of 
artificial  radioactive  isotopes,  such  as  radiocarbon  1f,C  with 
a  half-life  (42.4.5)  exceeding  5000  years: 

,‘N-Un-+'i!C-!-;p.  (•) 
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Subsequent  decay  proceeds  according  to  the  process 
*C-‘jN  +  _?e+«ve, 

where  _ie  and  Jve  represent  the  electron  and  electron  antineu¬ 
trino.  The  long  half-life  of  1J|C  is  the  basis  for  the  radiocarbon 
(carbon-14)  dating  technique  used  in  archaeology.  By  deter¬ 
mining  the  relative  amount  of  undecayed  radioactive  carbon, 
which  ceased  to  be  accumulated  in  a  living  organism  when  it 
died,  it  is  possible,  according  to  reaction  (*),  to  establish  the 
time  when  the  organism  no  longer  absorbed  the  isotope  1^C  from 
the  atmosphere.  This  isotope  is  formed  in  the  atmosphere  from 
nitrogen  by  the  action  of  cosmic-ray  neutrons. 

42.9.8  Slow  neutrons  are  ones  whose  de  Broglie  wavelength 
(38.1.4)  exceeds  the  nuclear  radius  71:  %  »  ti/mv  and,  accordingly, 
v  <  lilmR.  Their  energy  does  not  exceed  100  keV.  Neutrons 
with  energies  up  to  0.5  eV  are  called  thermal  neutrons.  As  such 
neutrons  pass  through  matter  they  undergo  capture ,  the  absorp¬ 
tion  of  the  neutrons  by  the  nuclei.  If  the  energy  of  the  neutrons 
coincides  with  that  of  the  compound  nucleus  (42.9.4),  resonance 
absorption  (resonance  capture)  of  the  neutrons  occurs.  This 
process  is  the  basis  of  the  technique  for  obtaining  transuranic 
chemical  elements.  The  transuranic  element  neptunium  2iJ§Np 
is  formed  upon  resonance  capture  of  neutrons  by  the  most  abun¬ 
dant  isotope  of  uranium  2?,fU  according  to  the  reaction 

2||U+>^*§SU-^-^V?Np, 


where  the  half-life  of  radioactive  isotope  2$“U  is  indicated, 
which  is  transformed  into  3ji”Np.  Next,  the  nucleus  of  isotope 
2j[fNp  is  transformed  into  plutonium  2jJfPu: 


2^Np 


P- 


2.3  days 


2,i!?Pu. 


Owing  to  its  efficient  fission  by  the  action  of  thermal  neutrons 
(42.9.10),  plutonium  plays  a  prominent  role  in  the  production 
of  nuclear  energy.  Plutonium  2;)£Pu  is  radioactive  by  alpha  par¬ 
ticle  emission  and  has  a  huge  half-life  (24  000  years).  It  is  trans- 
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formed  into  the  stable  isotope  of  uranium  2ij|U  according  to  the 
reaction: 


i’Pu 


2.4X  I  0  i  years 


->  205 r j 
112  u 


'12.9.9  Nuclear  reactions  using  accelerated  nuclei  of  chemical 
elements  enabled  advancement  in  Mendeleev’s  table  up  to  the 
element  with  the  number  Z  —  107.  The  element  with  Z  —  105 
was  named  nilsbohrium  (Ns).  Upon  bombarding  the  isotope 
2,!|Pu  with  nuclei  of  noon  ?^Nc  a  chemical  element  with  Z  =  104 
(kurchatovium  JgJKu)  is  obtained.  After  capture  of  the  nucleus 
fs'Nc  by  plutonium,  the  compound  nucleus  of  kurchatovium 
j’I'jKu  is  formed.  Then,  in  one  out  of  ten  thousand  million  com¬ 
pound  nuclei,  the  nucleus  of  the  element  f{j!}Ku  was  produced 
after  emitting  four  neutrons.  Thus 

Pu  +  fSNo  -  fg^Ku  -  ?SJKu  +  4‘n. 

In  the  most  powerful  beams  of  accelerated  neon  nuclei,  one  nu¬ 
cleus  of  kurchatovium  is  created  in  several  hours. 

42.9.10  A  heavy  compound  nucleus,  excited  in  the  resonance 
capture  of  a  neutron,  can  divide  into  two  approximately  equal 
parts  (fission  reaction  of  heavy  nuclei).  The  parts  that  are  formed 
are  called  fission  fragments.  The  instability  of  heavy  nuclei 
against  fission  is  due  to  the  large  number  of  protons  they  con- 
lain  which  are  subject  to  Coulomb  repulsion  from  one  an¬ 
other. 

The  fissure  of  a  heavy  nucleus  into  two  fragments  is  accompanied 
by  the  evolution  of  a  huge  amount  of  energy.  This  follows  from 
a  comparison  of  the  specific  binding  energies  of  nuclei  of  ele¬ 
ments  located  in  the  middle  and  at  the  end  of  Mendeleev’s  period¬ 
ic  system.  The  energy  released  in  the  fission  reaction  is  pro¬ 
portional  to  the  difference  in  the  specific  binding  energies  of 
an  unstable  “friable”  nucleus  and  two  “tightly  packed”  stable 
fission  fragments.  This  difference  constitutes  1.1  MeV  per  nu¬ 
cleon  (42.2.4).  In  the  fission  of  a  nucleus  of  uranium  2j!fU,  con¬ 
taining  238  nucleons,  the  evolved  energy  should  be  of  the  order 
of  200  MeV.  In  the  fission  of  nuclei  contained  in  1  gram  of  ura¬ 
nium  2;j|U,  the  energy  produced  is  8  X  10JO  J  or  22  000  kWh. 
Heavy  nuclei  arc  capable  of  fission  if  they  comply  with  the 
condition:  Z2M  ;>  17,  whore  Z2/A  is  called  the  fission  parameter. 
This  condition  is  complied  with  for  all  nuclei  beginning  with 
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silver  1i1?Ag  for  which  Z2lA  *  20.  Nuclei  for  which  (Z2lA)c r  S5 
.A’-  49  (the  critical  fission  parameters)  are  entirely  unstable  with 
respect  to  fission  and  cannot  exist.  For  kurchalovium  (Z  -=  104) 
(42.9.9).  Z2lA  41.  Spontaneous  nuclear  fission  is  possible  at 
values  Z2lA  <  (Z2M)cr,  which  proceeds  similar  to  radioactivity 
by  alpha-particle  emission  (42.5.1),  employing  the  tunnel  effect 
(38.7.2).  The  half-life  (42.4.5)  for  spontaneous  nuclear  fission 
is  from  1010  to  1017  years. 

42.9.11  At  the  instant  they  are  formed  fission  fragments  have 
a  neutron  excess,  i.e.  the  number  of  neutrons  are  in  excess  of 
the  number  of  protons.  Excess  neutrons,  emitted  by  the  frag¬ 
ments,  are  called  fission  neutrons.  Their  number  may  vary  and 
the  nuclear  fission  process  is  accompanied  by  neutron  multi¬ 
plication,  characterized  by  the  average  number  (v)  of  emitted 
neutrons  per  fission  event.  For  nuclei  of  plutonium  2JJPu  and 
uranium  2?,|U,  in  which  fission  is  due  to  thermal  neutrons,  (v) 
equals  3.0  and  2.5,  respectively.  Fission  neutrons  include  prompt 
(secondary)  and  delayed  neutrons.  Prompt  neutrons  are  emitted 
coincident  with  nuclear  fission  during  a  time  of  the  order  of 
10-14  s.  Delayed  neutrons  are  emitted  by  the  fission  products  at 
some  time  after  fission  occurs. 

42.9.12  If  each  of  the  prompt  neutrons  emitted  in  a  fission 
reaction  (42.9.11)  interacts  with  neighbouring  nuclei  of  the 
iissible  substance,  inducing  fission  in  them,  there  will  be  an 
avalanche-like  build-up  of  acts  of  nuclear  division.  This  is 
called  a  chain  fission  reaction.  A  condition  for  the  initiation  of 
a  nuclear  chain  reaction  is  the  availability  of  neutron  multipli¬ 
cation. 

The  neutron  multiplication  factor  k  is  the  ratio  of  the  number  of 
neutrons  emitted  at  a  given  stage  of  the  reaction  to  the  number 
of  such  neutrons  in  the  preceding  stage.  The  condition  for  a  self- 
sustaining  reaction  is  k  5?  1.  The  practical  feasibility  of  nuclear 
fission  chain  reactions  has  been  proved  by  the  development  of 
nuclear  power  engineering,  a  branch  of  engineering  in  which  var¬ 
ious  types  of  nuclear  reactors  have  been  designed  and  developed. 
Such  reactors  arc  devices  in  which  controlled  nuclear  chain  reac¬ 
tions  are  carried  out. 

42.9.13  The  reaction  of  the  fusion  of  nuclei  of  tritium  or  helium 
from  nuclei  of  deuterium,  discussed  in  item  (c)  of  Sect.  42.9.0, 
is  a  second  method  of  producing  intranuclear  energy  in  addi¬ 
tion  to  the  fission  of  heavy  nuclei.  The  specific  binding  energies 
(42.2.4)  in  three  nuclei — 2D,  ,H  and  |He — are  approximately 
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in  the  ratio  1:3:6.  This  means  that  the  nuclear  reactions  dis¬ 
cussed  in  Sect.  42.9.6  are  accompanied  by  the  evolution  oi 
great  amounts  of  energy:  4.04  MeV  in  the  lirst  reaction  and  3.27 
MeV  in  the  second.  Still  more  energy,  17.58  MeV,  is  released 
in  the  reaction 

JD  +  fH-JHc  +  Jn. 

Per  particle  this  energy  is  equal  to  17.6/5  MeV  —  3.5  MeV, 
i.c.  approximately  four  times  as  much  as  in  the  fission  of  ura¬ 
nium  2||U  (42.9.10):  200/238  MeV  =  0.85  MeV.  Even  more 
efficient  with  respect  to  the  specific  evolution  of  energy  is  the 
fusion  of  nuclei  of  helium  |He  from  four  protons.  This  energy 
amounts  to  6.70  MeV  per  particle. 

42.9.14  The  fusion  reaction  of  light  nuclei,  requiring  the  over¬ 
coming  of  the  potential  energy  of  their  repulsion,  can  be  effi¬ 
ciently  carried  out  at  super-high  temperatures  of  the  order  of 
108  or  10!)  K,  which  exceeds  the  temperature  in  the  central  re¬ 
gions  of  the  sun  (T  —  1.3  X  107  K).  Such  reactions  are  called 
thermonuclear  reactions  and  they  take  place  in  matter  in  a  plasma 
state  (22.6.1).  Thermonuclear  reactions  arc  evidently  the  sources 
of  energy  of  the  stars,  compensating  for  their  loss  of  energy 
through  radiation.  The  sun  radiates  3.8  X  102,i  .1  of  energy  per 
second.  This  corresponds  to  the  evolution  of  energy  per  unit 
mass  per  second  of  only  1.88  XlO-4  .T/s-kg.  This  is  only  one 
per  cent  of  the  specific  energy  production  of  a  living  organism 
in  the  process  of  metabolism. 

It  is  thought  that  thermonuclear  reactions  take  place  on  the  sun 
in  the  form  of  thermonuclear  chains ,  or  cycles,  in  which  energy 
is  released  by  the  transformation  of  hydrogen  nuclei  into  he¬ 
lium  nuclei. 

One  of  the  versions  of  the  proton-proton  chain,  or  deuterium 
cycle,  begins  with  combination  of  two  protons  to  form  deute¬ 
rium  with  the  emission  of  a  positron  and  an  electron  neutrino: 

!P+iP  ->  ?f>  +  +?e  -J- JJve. 

The  chain  continues  according  to  the  reaction 
?D  +  }p**-|He  +  v 

where  gamma  radiation  is  emitted,  not  only  as  excess  energy, 
but  in  the  combination  of  the  positron  with  electrons,  which 
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always  exist  in  plasma  (43.5.4).  A  probable  continuation  of  the 
chain  is  a  reaction  with  the  evolution  of  energy: 

!He  +  iHo-*JHe+2‘p, 

where  file  is  the  symbol  for  the  alpha  particle. 

In  the  carbon-nitrogen  cycle,  the  carbon  nuclei  are  “catalysts” 
in  the  reaction  for  the  fusion  of  hydrogen  nuclei  into  a  helium 
nuclei. 

At  the  beginning  of  the  cycle  a  fast  proton  penetrates  a  carbon 
nucleus: 

‘«C-|-  }p-»-  *®N  +  y. 

The  radioactive  isotope  of  nitrogen  1iiN,  with  a  half-life  of 
14  min,  decays  (42.6.1):  jp  (‘,n  +  +^e  f-  {jve  and  a  nucleus 
of  a  carbon  isotope  is  formed: 

*?N-«|C  +  +^  +  gve. 

Approximately  after  each  2.7  million  years,  a  nucleus  of  1  gG 
captures  a  proton,  forming  the  nucleus  of  the  stable  nitrogen 
isotope  J$N: 

*?C+{P-*-  *7N  +  Y- 

On  an  average  of  32  million  years  later,  the  nucleus  of  ’IN  cap¬ 
tures  a  proton  and  is  transformed  into  a  nucleus  of  oxygen  ’jjO: 

'*N  +  ‘p-**gO  +  Y. 

The  unstable  nucleus  of  3|0,  with  a  half-life  of  3  min,  emits  a 
positron  and  a  neutrino  as  it  is  transformed  into  the  nucleus 
of  !fN: 

.gO-'$N  +  +}e  +  j}ve. 

The  cycle  is  completed  by  a  reaction  that  takes  place  approxi¬ 
mately  after  100  thousand  years: 

l5N  +  iP-*JC  +  JHo. 

The  result  of  the  cycle  is  the  transformation  of  four  protons  into 
a  helium  nucleus  with  the  emission  of  two  positrons  and  gamma 
radiation.  The  released  energy  amounts  to  26.8  MeV  per  helium 
nucleus.  This  corresponds  to  an  energy  output  of  700  thousand 
kWh  per  gram-atom  of  helium.  This  energy  is  sufficient  to  com- 
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pensatc  for  the  energy  the  sun  loses  in  radiation.  Various  stages 
of  the  cycle  are  separated  by  tremendous  lengths  of  time  when 
judged  according  to  the  time  scale  on  the  earth.  But  this  cycle 
is  closed  and  continuous.  Hence,  all  the  stages  of  the  cycle 
proceed  simultaneously  on  the  sun,  having  begun  at  various 
instants  of  time. 

42.9.15  Conditions,  close  to  those  prevailing  in  the  interior  of 
the  sun,  were  provided  in  a  thermonuclear,  or  fusion,  bomb 
(more  commonly  called  the  hydrogen,  or  II,  bomb).  In  this  bomb 
a  self-sustaining  thermonuclear  fusion  reaction  of  an  explosive 
nature  is  initiated  in  a  mixture  of  deuterium  and  tritium. 
Thus 

JD  +  JH  -*•  IHe-j-Jn.  (*) 

The  high  temperature  required  for  a  thermonuclear  reaction 
was  obtained  by  the  explosion  of  an  “ordinary”  atomic  bomb, 
operating  on  the  principle  of  a  rapid  chain  reaction  in  the  fis¬ 
sion  of  heavy  nuclei. 

The  theoretical  basis  for  controlled  thermonuclear  fusion  is  a 
reaction  of  the  type 

iD  +  ?H  ->  jHe  +  Jn,  * 
as  well  as  of  the  type 

}D+*D-*?H+‘p  or  JD-hfD-lHe-l-Jn.  (**) 

The  feasibility  of  these  reactions  reduces  to  the  necessity  to 
meet  two  requirements:  the  availability  of  a  certain  least  tem¬ 
perature  and  a  definite  restriction  of  the  product  rex: 

rex  >  1016  and  T  >  109  K  for  reaction  (**), 
nx  >  10u  and  T  >  108  K  for  reaction  (*). 

Here  n  is  the  concentration  of  matter  in  the  plasma,  and  x 
is  the  time  the  particles  are  confined  in  the  plasma.  The  pre¬ 
ceding  inequalities  indicate  that  the  number  of  nuclei  reacting 
in  thermonuclear  fusion  should  be  proportional  to  the  concen¬ 
tration  of  matter  and  the  time  of  existence  of  the  nucleus. 
The  practical  realization  of  controlled  thermonuclear  fusion 
reactions  is  a  timely  problem  of  enormous  importance  and  its 
solution  should  be  reached  in  the  near  future. 


42-0186 
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CHAPTER  43  ELEMENTARY  PARTICLES 

43.1  Preliminary  Information  on  Elementary  Particles 

43.1.1  Elementary,  or  fundamental,  particles  are  ones  to  which 
an  internal  structure  that  is  a  simple  combination  of  other  stable 
particles  cannot  be  attributed  at  the  present  state  of  knowledge. 

TABLE  43.1 


Particles  and 
antiparticles 

Symbol 

Charge, 

e 

Mass, 

McV 

Lifetime,  s 

Photon 

7 

0 

0 

Stable 

Leptons 

Neutrinos: 
electron  neutrino 

and  antineutrino 

ve  ve 

0  0 

0 

Stable 

mu-meson  neutrino 
and  antineutrino 

0  0 

0 

Stable 

Electrons: 

electron  and  positron 

o'"  e+ 

— 1  +1 

0.511 

Stable 

Muons: 

p~  meson  and  p+  me- 

p_  p  + 

— 1  -1-1 

106 

2 .2  X  10"(i 

son 

Mesons: 

Pions: 

n+  meson  and  nr  me- 

1 

K 

+ 

+1  — 1 

2.6X10-8 

son 

3T.°  meson 

nn 

135 

0.8X10-16 

Kao  ns 

K+  meson  and  K~  me- 

* 

+ 

i 

+1  —1 

494 

1 .2X  10~8 

son 
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In  its  interaction  with  other  particles  and  fields,  an  elementary 
particle  behaves  as  an  integral  unit.  The  term  “elementary” 
particle  is  essentially  of  a  conditional  nature. 

43.1.2  In  energy  regions  less  than  2m0ci  (where  m0  is  the  rest 
mass  of  the  particle),  the  structure  of  particles  has  no  effect  on 
interaction  between  them  and  other  particles  and  fields.  In 
these  cases  elementary  particles  are  dealt  with  as  particles  in 


Spin, 

h 

Baryon 

charge 

Strange¬ 

ness 

Iso¬ 

spin 

Projection 
of  isospin 

Dominant  decay 
mode 

1 

0 

■ 

1/2 

0 

0 

1/2 

0 

0 

; 

1/2 

0 

0 

•  ! 

1/2 

0 

0 

,u--*e-  +  v(l  +  ve 

0 

0 

0 

0  0 

i 

+  1  — 1 

Ji+->It+  +  vll 

0 

0 

0 

i 

0 

it0  ->-  2y 

0 

0 

0 

+1  —1 

1/2 

+1/2  —1/2 

K+  +  h++vm. 

42* 
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TABLE  43.1  (coni.) 


Particles  and 
antiparticles 

Symbol 

Charge, 

e 

Mass, 

MeV 

Lifetime,  s 

Kg:  0.86x10-1° 

K°  meson,  anti-K° 

K“  K° 

0  0 

498 

K°:  5.38X10"8 

meson 

H°  meson 

T)° 

0 

549 

2.4x10-“ 

1 

Baryons 

■H 

Nucleons: 

proton  and  antipro- 

938.2 

Stable 

ton 

|M 

neutron  and  anti- 

939.6 

0.93X10° 

neutron 

■A 

Hyperons: 

A0  hyperon  and  anti- 

A®  A° 

0  0 

1116 

2.5X10-“ 

A0  hyperon 

2+  hyperon  and  anti- 

2+  2“ 

+1  —  1 

1189 

0.8X10-10 

2+  hyperon 

2~  hyperon  and  anti- 

2-  2+ 

—1  4-1 

1197 

1.5X10-10 

hyperon 

2°  hyperon  and  anti- 

0  0 

1192 

<  10-“ 

2°  hyperon 

3~  hyperon  and  anti- 

2“  S  h 

— 1  +1 

1321 

1.7X10-“ 

B-  hyperon 

a°  hyperon  and  anti- 

0  0 

1315 

3X10-!“ 

a0  hyperon 

Q-  hyperon  and  anti- 

Q-  Q+ 

— 1  4-1 

1672 

1.3x10-1° 

Q-  hyperon 
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Spin, 

h 

Baryon 

charge 

Strange¬ 

ness 

Iso¬ 

spin 

Projection 
of  isospin 

Dominant  decay 
mode 

Ks?  -»■  2 it 

0 

0 

0 

+i 

— 1 

1/2 

-1/2  +1/2 

ItJlV 

K?  4-3 n 
\ 

^jtev 

0 

0 

0 

0 

0 

T).°  ->3jt.° 

^  ji+jtjx0 

1/2 

+i 

— 1 

0 

0 

1/2 

+  1/2  —1/2 

1/2 

+i 

— 1 

0* 

0 

1/2 

—1/2  +1/2 

n-*-p  +  e-  +  Ve 

1/2 

+i 

— 1 

— 1 

+1 

0 

0  0 

A®  -*■  p+  n~ 

1/2 

1+1 

— 1 

— 1 

+1 

1 

+1  —1 

2+-*N  +  ji 

1/2 

+i 

— 1 

-1 

+1 

1 

-1  +1 

S-  ->  n  +  n~ 

1/2 

+i 

-1 

— 1 

+1 

1 

0  0 

2°->-  A°  +  y 

1/2 

+i 

— 1 

—2 

+2 

1/2 

—1/2  +1/2 

S~~>-  A»  +  ji- 

1/2 

+i 

— 1 

— 2 

+2 

1/2 

+1/2  —1/2 

H0->  A°  +  jt° 

3/2? 

+i 

— 1 

— 3 

+3 

0 

0  0 

Q-  S-  +  Jt® 

\ 

A°+K~ 
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the  sense  of  material  points,  but  possessing  many  properties, 
such  as:  rest  mass,  electric  charge,  spin  and  a  dominant  decay 
mode  (Table  43.1).  The  concept  of  a  point  elementary  particle 
meets  the  requirements  of  relativity  theory.  An  extended  par¬ 
ticle  of  finite  dimensions,  which  is  an  integral  whole,  should 
not  be  deformable  as  otherwise  independent  movements  of 
its  parts  would  be  possible.  But  this  would  mean  that  external 
action  on  such  an  elementary  particle  would  be  instantaneously 
transmitted  from  certain  parts  to  others,  and  this  contradicts 
relativity  theory  (5.1.1). 

43.1.3.  A  structureless  charged  elementary  particle  has  an  in¬ 
finite  potential  (16.2.6)  and  infinite  potential  energy.  Up-to-date 
particle  physicists  are  working  out  various  methods  of  intro¬ 
ducing  nonpoint-like  interaction.  These  methods  eliminate  the 
“infinity”  of  point  elementary  particles,  and  introduce  definite 
structures  for  them  without  contradicting  relativity  theory. 

It  follows  from  the  indeterminacy  relations  (38.6.2)  that  a 
study  of  the  structure  of  elementary  particles  requires  the  inves¬ 
tigation  of  processes  taking  place  at  the  extremely  short  distances 
6  that  the  momentum  indeterminacy  A p  >  hi 6  corresponds 
to.  The  shorter  the  distances  6,  associated  with  the  structure  of 
elementary  particles,  the  greater  the  momentum  p  of  the  par¬ 
ticles  must  be,  because  it  cannot  be  less  than  Ap.  Consequently, 
the  structure  of  elementary  particles  can  be  cleared  up  only  by 
research  involving  high  energies.  For  this  reason,  particle  phys¬ 
ics,  the  new  branch  of  physics  engaged  in  the  study  of  elementa¬ 
ry  particles,  is  now  more  commonly  called  high-energy  physics. 


43.2  Classification  of  Elementary  Particles  and  Their 
Interaction 

43.2.1  There  are  several  groups  of  elementary  particles  which 
differ  in  their  properties  and  the  nature  of  the  interaction  be¬ 
tween  one  another. 

The  main  classification  of  elementary  particles  is  based  on 
their  various  rest  masses:  distinction  is  made  between  leptons 
(light  particles),  mesons  (intermediate  particles)  and  baryons 
(heavy  particles),  with  the  last  being  further  subdivided  into 
nucleons  and  hyperons. 

43.2.2  As  to  their  electric  charge,  measured  in  units  of  the 
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elementary  charge  e  (14.1.2),  particles  may  be  positive,  nega¬ 
tive  or  neutral.  The  existence  of  particles  with  fractional  elec¬ 
tric  charges,  called  quarks,  has  been  predicted. 

With  respect  to  their  spin  (26.1.3),  elementary  particles  are 
subdivided  into  ones  with  integer,  half-integer  and  zero  spin 
(in  units  of  Ti).  Depending  upon  their  spin,  assemblies  of  par¬ 
ticles  comply  with  different  quantum  statistics  (41.2.1  and 
41.2.4). 

43.2.3  Elementary  particles  are  either  stable  or  unstable.  This 
classification  characterizes  their  lifetime  and  the  dominant 
decay  mode.  The  characteristics  of  elementary  particles  are  listed 
in  Table  43.1.  The  rest  masses  of  the  particles  are  given  in 
energy  units,  as  is  now  customary  practice,  on  the  basis  of  the 
relation  E  =  me 2  (5.7.2).  Since  the  rest  mass  of  the  electron 
is  approximately  0.5  MeV,  the  mass  of  a  particle  in  electron 
mass  units  can  be  found  by  multiplying  its  mass  in  MeV  by 
two. 

43.2.4  In  addition  to  the  elementary  particles  listed  in  Ta¬ 
ble  43.1,  there  are  also  a  great  many  particles  called  resonances 
( resnnons  and  resonance  particles).  Their  lifetimes  are  character¬ 
istic  of  strong  interaction  (43.2.5);  they  have  definite  proper¬ 
ties,  energies  and  momenta,  and  definite  decay  modes.  These 
features  enable  resonances  to  bo  regarded  as  special  particles. 

43.2.5  Mainly  three  of  the  four  known  kinds  of  interaction  exist 
between  elementary  particles:  strong,  electromagnetic  and  weak. 
Gravitational  interaction  between  elementary  particles  is  usu¬ 
ally  not  considered.  Each  of  the  interactions  is  characterized  by 
its  interaction  coupling  constant,  determining  its  comparative 
magnitude,  and  its  characteristic  interaction  time  ( time  con¬ 
stant  of  interaction). 

43.2.6  Strong  interactions  refer  to  processes  involving  baryons, 
as  well  as  pions  and  kaons  (Table  43.1).  Strong  interactions 
result  from  nuclear  forces  between  nucleons  (42.3.1),  as  well 
as  the  processes  of  creation  and  decay  of  mesons  and  hyperons 
in  nuclear  reactions  at  high  energies.  Strong  interaction  is  char¬ 
acterized  by  the  Fermi  constant  f)  =  /2/4c  ss  1,  where  the  con¬ 
stant  /  has  the  dimension  of  an  electric  charge.  Processes  that 
display  strong  interaction  are  said  to  be  fast  and  have  a  charac¬ 
teristic.  nuclear  time  tn.  In  order  of  magnitude  this  time  equals 
the  Compton  wavelength  of  the  nucleon  hi mvc  (where  m p  is  the 
mass  of  the  nucleon),  divided  by  the  velocity  of  light  in  free 
space,  i.e.  hi  tripe 2. 
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Particles  that  take  part  in  strong  interactions  are  called  hadrons 
(large  and  heavy  particles). 

43.2.7  Electromagnetic  interactions  characterize  processes  that 
take  place  between  particles  that  have  electric  charges.  For 
instance,  Coulomb  repulsion  in  nuclei,  and  the  processes  of 
creation  and  annihilation  of  electron-positron  pairs  are  asso¬ 
ciated  with  electromagnetic  interaction.  A  measure  of  this 
interaction  is  a  dimensionless  constant  called  the  fine-structure 
constant :  a  =  e2/tic  =  1/137.  It  has  been  thus  named  because  a 
determines  the  magnitude  of  fine  splitting  of  the  atom’s  energy 
level  into  a  number  of  sublevels.  This  is  caused  by  the  effect 
of  the  relativistic  dependence  of  mass  on  velocity  (5.6.1)  on 
the  electron’s  energy  in  the  atom.  Processes  accompanying 
electromagnetic  interaction  are  called  electromagnetic  processes 
and  they  proceed  during  a  time  of  the  order  of  lO-20  to  10~18  s 
in  accordance  with  the  fact  that  a/p  »  10~2  (43.2.6).  Elec¬ 
tromagnetic  interaction  is  the  only  one  of  the  interactions  of  the 
microcosm  that  is  also  manifested  in  macroscopic  phenomena 
and  processes. 

TABLE  43.2 


Interaction 

Relative 

strength 

Interaction  time,  s 

Strong 

1 

lO-22  t„  lO-22 

Electromagnetic 

1/137 

lO-2®  to  lO"18 

Weak 

10-h 

lO'10  to  lO"8 

43.2.8  Weak  interactions  are  processes  involving  leptons  (Ta¬ 
ble  43.1).  Examples  of  such  processes  are  beta  decay  of  nuclei 
(42.6.1),  and  interaction  between  nuclei  and  muons,  electrons 
(or  positrons),  and  neutrinos  (or  antineutrinos). 

Weak  interaction  is  characterized  by  the  weak  interaction  con¬ 
stant  G,  having  the  dimension  of  the  inverse  square  of  mass. 
The  dimensionless  weak  interaction  constant  g2  —  Gm 2  ■x  lO"5, 
where  mp  is  the  mass  of  the  proton.  Weak  interaction  is  the 
slowest  of  the  interactions  manifested  in  the  microcosm  and 
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proceeds  during  time  of  the  order  of  lO-10  s.  A  comparison  of 
the  various  kinds  of  interaction  between  elementary  particles 
is  given  in  Table  43.2. 


43.3  Certain  information  on  Various 
Elementary  Particles 

43.3.1  Various  kinds  of  elementary  particles  (Table  43.1)  are 
produced  in  the  interaction  of  cosmic  rays  with  the  nuclei  of 
the  nitrogen  and  oxygen  atoms  in  the  atmosphere.  Since  the 
development  of  modern  accelerators,  the  creation,  mutual  trans¬ 
formations  and  structure  of  elementary  particles  are  investigated 
in  the  laboratory. 

43.3.2  The  feasibility  of  creating  new  particles  in  the  interac¬ 
tion  between  protons  and  nuclei  follows  from  the  relation  between 
mass  and  energy  (5.7.2).  When  an  energy  of  1 04  GeV  is  imparted 
to  the  proton,  approximately  104  times  more  than  its  rest  energy 
mpc2  (where  mp  is  the  rest  mass  of  the  proton  and  c  is  the  velocity 
of  light  in  free  space),  a  collision  between  the  proton  and  a 
nucleus  leads  to  splitting  of  the  latter.  High  kinetic  energy  is 
imparted  to  the  fission  products  and  new  particles  are  created, 
some  having  a  rest  mass  and  others  without  one. 

43.3.3  Muons  ( mu  mesons)  are  the  electrically  charged  parti¬ 
cles  p+  and  p~  with  a  rest  mass  equal  approximately  to  200  me, 
where  meis  the 'rest  mass  of  the  electron.  The  main  characteris¬ 
tics  of  muons  are  listed  in  Table  43.1.  Muons  are  not  stable, 
and  decay  according  to  the  modes: 

p+  ++?e  +  gve  +  £\y  and  pr  -►  J}e  +  iJve  + 

where  _°e  and  +$e  are  the  electron  and  positron,  J|ve  and  JJve 

are  the  electron  neutrino  and  antineutrino,  and  §v^  and  gv^ 
are  the  muon  neutrino  and  antineutrino. 

The  muon  neutrino  (or  antineutrino)  is  a  particle  having  nei¬ 
ther  rest  mass  nor  electric  charge,  and  which  is  emitted  in  the 
creation  and  decay  of  mesons.  These  particles  differ  from  the 
neutrino  (or  antineutrino)  emitted  together  with  electrons  in 
beta-decay  processes  (42.6.1).  Therefore,  the  two  types  of  neu- 
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trino  (and  antineutrino)  are  designated  by  different  symbols: 

nve  (ove)  and  ovm-  (oV^)-  The  spin  of  the  mu  mesons  equals  ±hl 2 
and  the  muon  decay  reaction  complies  with  the  conservation 
laws  (42.9.2).  The  interaction  of  muons  with  nuclei  is  an  exam¬ 
ple  of  weak  interaction  (43.2.8),  and  muons  are  nuclearly  inac¬ 
tive  particles. 

43.3.4  Pions  ( pi  mesons)  are  neutral  (n°)  and  electrically  charged 
(n+  and  jit)  particles  with  a  rest  mass  of  approximately  300  me, 
whore  me  is  the  rest  mass  of  the  electron.  The  main  character¬ 
istics  of  pions  are  listed  in  Table  43.1.  Pions,  like  muons,  are 
unstable  and  decay  according  to  the  modes: 

3x+^p+  +  jjvn,  n-  —  p-  +  §vn  and  Ji#->y  +  y. 

In  these  decay  reactions  p+  and  [i~  are  positive  and  negative 

muons,  gv|x  and  are  the  muon  neutrino  and  antineutrino, 
and  y  is  a  gamma-ray  quantum  (photon).  Shown  schematically 


Fig.  43.1 


Fig.  43.2 


in  Fig.  43.1  is  the  sequence  for  the  (n-p-e)  decays.  Pions  are  spin¬ 
less  particles.  This  follows  from  their  decay  reactions  and  the 
requirement  that  these  reactions  comply  with  the  conservation 
laws. 

43.3.5  Pions  are  formed  in  cosmic  rays  as  the  result  of  the  dis¬ 
integration  of  the  nuclei  of  atmospheric  gas  atoms  by  fast 
protons  and  alpha  particles.  To  produce  pions  the  energy  of 
the  protons  should  bo  about  300  McV. 

The  arrangement  for  obtaining  pions  in  the  laboratory  is  illus¬ 
trated  in  Fig.  43.2.  Target  A  of  beryllium  or  carbon  is  bom- 
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barded  by  fast  protons  }p.  This  knocks  pions  out  of  the  target 
at  arbitrary  angles.  The  magnetic  field  of  the  accelerator  causes 
the  ejected  pions  to  follow  circular  paths  whose  radii  arc  deter¬ 
mined  by  the  velocities  of  the  pions.  Pions  ejected  in  the  for¬ 
ward  direction  are  divided  into  two  beams,  the  negative  ones 
being  deflected  outside  the  accelerator  chamber  and  the  positive 
ones  inside.  The  opposite  is  true  for  pions  ejected  in  the  back¬ 
ward  direction  (not  shown  in  Fig.  43.2). 

The  masses  of  charged  ions  are  determined  by  their  energies 
and  momenta  by  methods  employing  the  deflection  of  particles 
in  magnetic  and  electric  fields.  The  lifetime  of  pions  (Table  43.1) 
is  established  by  measuring  the  time  intervals  between  the 
instant  of  creation  of  a  pion  and  its  instant  of  decay  (43.3.4). 
Pions  are  nuclearly  active  particles.  Their  interaction  with 
nuclei  is  an  example  of  strong  interaction  (43.2.6). 

Similar  methods  are  applied  for  investigating  other  charged 
particles.  Since  neutral  particles  are  not  deflected  in  a  magnetic 
field,  they  arc  investigated  by  means  of  the  conservation  laws 
and  interaction  reactions  of  the  given  particle  with  other  parti¬ 
cles.  For  example,  the  reaction  of  a  negative  pion  with  a  proton 
proceeds  according  to  the  mode 

The  negative  pion  is  transformed  into  a  neutral  one,  and  the 
proton  into  a  neutron. The  neutral  pion  ji°  decays  into  two  gamma- 
ray  quanta  (43.3.4).  The  mass  of  the  neutral  pion  (Table  43.1) 
is  determined  from  the  known  masses  and  energies  of  the  proton, 
neutron  and  negative  pion,  from  the  energy  of  gamma-ray  quan¬ 
ta,  as  well  as  from  data  on  the  momenta  of  particles  in  these 
transformations. 

43.3.6  The  principle  of  applying  the  law  of  conservation  of 
momentum  for  studying  the  transformations  of  elementary  par¬ 
ticles  can  be  illustrated  by  the  example  of  the  decay  in  which 

an  electron  antineutrino  S)ve  is  created  (42.6.1).  If  only  a  single 
electron  were  ejected  by  the  nucleus  in  a  p_  decay,  the  nucleus 
would  be  subject  to  recoil  in  the  direction  directly  opposite  to 
that  of  the  ejected  electron.  The  momentum  of  the  nucleus 
that  was  stationary  before  the  decay  reaction  would  be  equal 
in  magnitude  but  opposite  in  direction  to  that  of  the  ejected 
electron.  If,  however,  in  addition  to  the  electron,  the  nucleus 
also  ejects  an  antineutrino,  the  vector  sum  of  the  three  momenta. 
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that  of  the  electron,  antineutrino  and  recoil  nucleus,  should, 
according  to  the  law  of  conservation  of  momentum  (2.7.1), 
be  equal  to  zero,  as  before  the  decay  reaction  (Fig.  43.3). 
43.3.7  Kaons,  or  K  mesons,  are  neutral  (K°)  and  charged  (K  + 
and  K~)  particles  with  rest  masses  close  to  1000  me,  where  m.e 
is  the  rest  mass  of  the  electron.  As 
other  mesons,  kaons  are  unstable,  but, 
in  contrast  to  the  decay  reactions  of 
pions  and  muons  (43.3.3  and  43.3.4), 
the  reaction  channels  of  their  decays 
are  not  unambiguous.  As  a  result  of 
kaon  decay,  pions,  muons  and  neu¬ 
trinos  (or  antineutrino)  of  both  types 
are  formed.  Like  pions,  kaons  are 
spinless.  All  of  these  particles  (pions 
and  kaons)  are  united  in  a  single  group, 
the  mesons  (Table  43.1). 

43.3.8  A  large  group  of  particles, 
called  hyperons,  have  rest  masses 
greater  than  that  of  nucleons,  and 
range  approximately  from  2183  me 
for  the  lambda-zero  hyperon  (A0)  to 
3280  me  for  the  omega-minus  hyperon 
(Q_).  Hyperons  are  unstable  (see 
Table  43.1).  The  lighter  hyperons  decay  into  nucleons  and  pions. 
The  heavier  hyperons  decay  into  lighter  hyperons,  pions  and 
kaons.  All  hyperons  have  a  spin  equal  to  m2,  except  the  Q~ 
hyperon,  whose  spin  is  equal  to  3/?./ 2.  Nucleons  (protons  and  neu¬ 
trons)  and  hyperons  are  related  particles  and  they  belong  to 
a  single  class  of  heavy  particles  called  baryons  (Table  43.1). 

43.4  Certain  Conservation  Laws  in 
Elementary-Particle  Physics 

43.4.1  The  differences  in  the  rest  masses  of  charged  it*  and  the 
neutral  n°  pions,  like  the  difference  in  the  rest  masses  of  the 
two  states  of  nucleons — proton  and  neutron — in  the  nucleus, 
are  the  result  of  charge  independence  of  nuclear  forces  (42.3.3) 
and  of  the  presence  of  electromagnetic  interactions  in  addition 
to  strong  nuclear  interactions.  With  no  electromagnetic  interac¬ 
tions  in  the  nucleus,  charge  independence  of  nuclear  forces 
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would  lead  to  a  condition  in  which  the  proton  and  neutron  have 
equal  rest  mass,  and  the  masses  of  all  types  of  pions  coincide. 
When  the  additional  electromagnetic  interaction  is  taken  into 
account,  the  double  state  of  the  nucleon  (proton-neutron  doublet) 
is  split  into  two  different  states  in  accordance  with  their  rest 
masses.  The  masses  of  the  neutron  and  proton  are  found  to  dif¬ 
fer.  For  the  same  reason,  the  masses  of  the  particles  that  make  up 
the  pion  triplet  are  also  not  equal  to  one  another.  The  masses 
of  the  charged  n±  pions  exceed  that  of  the  ix°  pion  (Table  43.1). 

43.4.2  Introduced  to  characterize  the  multiplicity  of  states  of 
certain  kinds  of  particles  are  the  concepts  of  isotopic  spin  T 
( isospin )  and  the  projection  T g  of  isotopic  spin  on  a  certain  “axis”. 
The  introduction  of  these  characteristics  of  elementary  particles 
makes  it  possible  to  formulate  special  features  of  the  properties 
and  transformation  of  elementary  particles  that  follow  from 
charge  independence  in  strong  interactions.  Isotopic  spin  T 
characterizes  the  number  of  particles  in  the  given  “charge  mul- 
tiplet”;  this  number  is  equal  to  (2T  +  1).  Thus  the  isotopic 
spin  of  the  nucleon  equals  T  =  1/2,  and  that  of  the  pion  is 
T  =  1,  in  accordance  with  the  number  of  particles  (nucleons 
and  pions)  in  the  doublet  and  triplet. 

The  projection  Tg  of  isotopic  spin  of  the  nucleon  is  introduced 
in  such  a  way  that  it  is  related  to  the  charge  Z  (in  e  units)  of 
a  particle  that  is  a  member  of  a  charge  doublet  as  follows: 

Z  =  Tg  +  1/2. 

For  the  neutron  Tg  =  —1/2,  for  the  proton  7g  =  1/2. 

For  the  triplet  of  pions,  this  relation  is  of  a  more  complex  na¬ 
ture  (43.4.3)  and  leads  to  the  situation  in  which  the  projection  Tt 
of  isotopic  spin  is  equal  to  the  charge  of  the  particle  (in  e  units): 
7g  =  0  for  the  ji°  meson,  Tg  =  1  for  the  ji+  meson  and  7g  = 
=  — 1  for  the  n~  meson. 

The  law  of  conservation  of  isotopic  spin  states  that  the  total  iso¬ 
topic  spin  T  of  all  the  particles  of  an  isolated  system  remains 
constant  through  all  transformations  caused  by  charge-inde¬ 
pendent  strong  interactions. 

43.4.3  The  concept  of  a  baryon  or  nuclear  ( nucleon )  charge  B 
can  be  introduced  for  all  elementary  particles.  For  baryons 
(Table  43.1)  B  =  + 1,  for  antibaryons  B  —  —  1,  whereas  for 
particles  that  do  not  belong  to  the  baryon  class  B  =  0. 

The  law  of  conservation  of  baryon  charge  states  that  in  all  nuclear 
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transformations  in  an  isolated  system  the  total  baryon  charge 
remains  constant. 

This  law,  like  the  law  of  conservation  of  electric  charge,  is 
valid  both  in  strong  and  in  electromagnetic  interactions.  Making 
use  of  the  baryon  charge  B,  the  relation  between  the  electric 
charge  Z  and  the  projection  of  isotopic  spin  T |  of  a  particle  in 
the  charge  multiplets  of  nucleons  and  pions  can  be  written  as 

Z  =  Ti  +  Bl  2. 

For  nucleons  (proton  and  neutron)  B  =  1  and  Z  =  T%  +  1/2 
(43.4.2).  For  pions  B  =  0  and  Z  =  T^. 

Since  Z  and  B  are  conserved  both  in  strong  and  electromagnetic 
interactions,  is  also  conserved  in  these  interactions,  whereas 
T  is  conserved  only  in  strong  interactions. 

43.4.4  The  investigations  of  kaons  and  hyperons  (Table  43.1) 
revealed  certain  of  their  unusual  properties.  The  creation  of 
these  particles  is  brought  about  by  strong  interaction,  and  the 
time  for  their  creation  corresponds  to  nuclear  time  (43.2.6). 
At  the  same  time,  when  they  decay  into  nuclearly  active  pions, 
kaons  have  a  lifetime  typical  of  weak  interaction  (Table  43.2). 
Moreover,  it  was  established  that  kaons  and  hyperons  are  al¬ 
ways  created  in  pairs,  but  not  in  arbitrary  combinations.  Fi¬ 
nally,  a  substantial  difference  was  found  in  the  conditions  for 
the  creation  of  kaons  and  the  reactions  of  their  interaction  with 
other  particles.  These  features  made  it  necessary  to  consider 
kaons  and  hyperons  to  be  strange  particles.  A  special  character¬ 
istic  of  elementary  particles  was  introduced  to  explain  the 
behaviour  of  these  strange  particles.  It  is  called  strangeness  S 
and  is  nonzero  only  for  strange  particles  (S  —  ±1,  +  2,  .  .  .) 
and  zero  for  “ordinary”  particles.  For  strange  particles  the  rela¬ 
tion  in  Sect.  43.4.3  is  of  the  form 


Z=Tl  + 


B-\-S 

2 


From  the  conservation  law  formulated  in  Sect.  43.4.3  follows 
the  law  of  conservation  of  strangeness  which  states  that  the  total 
strangeness  of  an  isolated  system  remains  constant  in  strong 
and  electromagnetic  interactions.* 


*  The  information  given  in  this  section  does  not  cover  all  the 
conservation  laws  in  high-energy  physics.  But  information  on  the 
other  laws  is  beyond  the  scope  of  this  handbook. 
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43.5  Antiparticles 

43.5.1  The  majority  ol  elementary  particles  have  corresponding 
antiparticles.  Examples  of  particles  and  antiparticles  are:  the 
electron  _5e  and  positron  +fe,  muons  u.+  and  pions  n+  and  it-, 
kaons  K+  and  K~,  the  electron  and  muon  neutrinos,  [jvc  and 

(]v(l,  and  anti  neutrinos,  §ve  and  qV(1.  The  rest  masses,  spins 
and  lifetimes  of  particles  and  their  antiparticles  are  the  same. 
The  electric  charges  of  particles  and  their  antiparticles  are 
the  same  in  magnitude,  but  opposite  in  sign. 

Particles  whose  properties  arc  completely  identical  with  those 
of  their  antiparticles  are  called  truly  neutral  particles.  They 
include  the  photon  and  the  neutral  mesons,  ji°  and  K°. 

43.5.2  The  existence  in  nature  of  pairs  of  charged  particles 
and  their  antiparticles  expresses  the  principle  of  charge  conju¬ 
gation,  which  states  that  each  charged  elementary  particle 
must  have  an  antiparticle.  In  accordance  with  this  principle. 

the  proton  has  an  antiparticle,  the  antiproton  _]p  (Table  43.1). 
The  principle  of  charge  conjugation  extends  to  the  neutral 
particles,  neutron  and  neutrino,  as  well.  There  is  the  antineu¬ 
tron  Jn,  as  well  as  fche  electron  antineutrino  |{ve  and  muon  anti¬ 
neutrino 

43.5.3  When  a  particle  combines  with  its  antiparticle,  the 
energy  released  is  not  less  than  twice  the  rest  energy  of  each  of 
them.  The  foruiation  of  a  “particle-antiparticle”  pair  requires 
an  energy  input  that  exceeds  twice  the  rest  energy  of  each  par¬ 
ticle.  This  is  due  to  the  necessity  to  impart  momentum  and  ki¬ 
netic  energy  to  the  pair  being  produced. 

43.5.4  The  process  of  producing  an  electron-positron  pair  occurs 
in  the  collision  of  a  hard  gamma-ray  quantum  with  some  kind 
of  charged  particle,  for  instance,  with  an  electron  on  the  shell 
of  an  atom  or  with  an  atomic  nucleus*.  This  process  proceeds 
according  to  the  reaction  y  _“e  -f  +(,'e  and  is  feasible  with 


*  The  need  for  a  third  particle  is  due  to  the  fact  that  in  producing 
two  particles  with  the  rest  mass  m.0  0  from  a  photon,  the 

total  momentum  of  the  two  particles  is  less  than  the  momentum 
hvlc  of  the  photon.  Hence,  a  third  particle  is  required  to  take 
up  part  of  the  momentum  of  the  photon. 
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a  photon  energy  not  less  than  2d iec2,  i.e.  fcv  >  2 mec2  —  1.022 
MeV,  where  mec2  —  0.511  MeV  is  the  rest  energy  of  the  electron 
(or  positron).  It  follows  from  the  reaction  for  producing  a  pair 
by  the  action  of  a  gamma-ray  quantum  (photon)  that  the  spin 
of  the  photon  should  equal  0  or  1  (in  units  of  h),  because  the 
spins  of  electron  and  positron  are  each  equal  to  hi 2.  A  great 
many  experimentally  established  facts  and  great  deal  of  theo¬ 
retical  investigation  lead  to  the  conclusion  that  the  spin  of  the 
photon  is  equal  to  Ti. 

43.5.5  The  process  of  uniting  an  electron  and  a  positron  is  cal¬ 
led  positron-electron  annihilation.  In  the  great  majority  of  cases, 
the  process  results  in  the  production  of  two  gamma-ray  quanta: 

-?e-b+?e->2y. 

Before  combining  the  particles  _,e  and  +“e,  the  total  momentum 
of  the  two  particles,  in  the  coordinate  system  fixed  in  the  centre 
of  mass  (2.3.3)  of  the  electron-positron  system,  was  equal  to 
zero.  After  annihilation  of  the  pair,  two  photons  are  produced 
whose  momenta  are  in  opposite  directions,  and  their  total  mo¬ 
mentum  should  be  equal  to  zero*.  Each  photon  carries  away 
energy  equal  to  hv  =  mec2  =  0.511  MeV. 

43.5.6  The  minimum  energy  required  to  produce  a  proton-anti¬ 
proton  pair  in  a  coordinate  system  where  one  nucleon  is  at 
rest  is  6 mpc2  =  5.6  GeV.  This  energy  is  reduced  to  4.3  GeV 

when  the  Jp-_Jp  pair  is  actually  produced.  Considerably  more 
energy  is  required  to  produce  a  hyperon-antihyperon  pair  than 
for  a  pair  of  nucleons.  This  is  due  to  the  comparatively  large 
mass  of  the  hyperons.  For  example,  the  production  of  the  lightest 

antihyperon,  the  antilambda-zero  hyperon  (A0),  requires  an 
energy  from  5.3  to  5.8  GeV. 

43.5.7  A  feature  of  antiparticles  is  their  capacity  to  combine 
rapidly  with  their  corresponding  particles,  for  instance,  po¬ 
sitrons  with  electrons,  antiprotons  with  protons,  and  antineu¬ 
trons  with  neutrons.  The  reason  is  that  the  matter  of  which  all 
of  nature  surrounding  us  is  built  up  consists  of  electrons,  pro¬ 
tons  and  neutrons.  Antiparticles,  artificially  produced,  inevi- 


*  Annihilation  with  the  production  of  three  photons,  comply¬ 
ing  with  the  laws  of  conservation  of  momentum  and  energy,  is 
also  possible. 
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lably  meet  their  much  more  numerous  particles  in  substances, 
combine  with  them  and  cease  to  exist,  giving  rise  to  the  produc¬ 
tion  of  new  particles. 

43.5.8  The  difference  between  neutral  particles  and  their  anti¬ 
particles  is  established  by  the  nature  of  the  interactions  of  the 
particles  and  antiparticles  with  substances.  Examples  are  the 

interaction  reactions  of  the  antineutrino  $ve  with  protons  }p 
and  the  neutrino  gve  with  neutrons: 

“ve+  !P  on  ++ie  and  §ve  +  on  lP  +  -?e  (*) 

43.5.9  The  difference  between  electron  and  muon  neutrinos  is 
established  by  investigating  the  decays  of  charged  pions  (43.3.4). 
If  the  neutrinos  produced  in  these  decays  are  separated  out  and 
their  capture  by  neutrons  is  accomplished,  then  instead  of  the 
reaction  (*)  of  Sect.  43.5.8,  the  process  will  be  the  following: 

oVu  +  o11  -*■  ‘P  +  11-. 

where  p.~  is  the  negative  muon.  This  demonstrates  the  differ¬ 
ence  between  the  electron  and  muon  neutrinos  (and  antincu- 
trinos). 


43.6  Structure  of  the  Nucleon 

43.6.1  The  structure  of  the  nucleon  (or  any  other  elementary 
particle)  is  understood  to  mean  its  extent  in  space  and  what  it 
consists  of.  The  structure  of  any  elementary  particle  cannot  be 
treated  by  itself.  It  is  related  to  the  structure  and  properties 
of  other  particles.  Special  experiments  for  studying  their  struc¬ 
ture  have  been  conducted  only  for  nucleons*.  Effective  methods 
for  investigating  the  structure  of  nucleons  are  elastic  collisions 
of  pions  with  protons  and  elastic  collisions  of  fast  electrons 
with  protons  and  neutrons.  The  first  method  showed  that  the 
pion  is  only  slightly  deflected  by  the  collision  from  its  initial 


*  The  problem  of  the  structure  of  elementary  particles,  as 
well  as  attempts  to  establish  certain  fundamental  particles  which 
all  others  are  built  up  of  are  the  frontiers  of  high-energy  physics. 
The  present  handbook  contains  only  general  information  on  the 
structure  of  nucleons. 
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direction,  and  the  proton  received  only  slight  recoil,  so  that 
the  momentum  A p  transmitted  to  the  proton  is  very  small. 
It  follows  from  the  indeterminacy  relation  (38.6.2)  that  the 
process  of  the  collision  of  a  pion  with  a  proton  takes  place  in 
a  certain  region  of  space  with  the  linear  dimensions  a  >  hi  A p , 
where  a  characterizes  the  size  of  the  nucleon. 

43.6.2  The  emission  and  absorption  of  particles  and  anti- 
particles  proceed  continuously  in  a  nucleon.  A  nucleon  is  treated 

as  a  complex,  incessantly  changing 
assembly  of  many  particles. 

At  the  central  part  of  the  nucleon 
(bare  nucleon)  is  the  nucleon  core 
having  a  radius  of  (0.2  to  0.4)  x 
X  10~16  m.  In  this  region,  heavy 
particles,  resonances  and  nucleon- 
antinucleon  pairs,  play  some  spe¬ 
cial  role  that  is  not  yet  quite  clear. 
The  outer  part  of  the  nucleon 
forms  the  pion  cloud  (Fig.  43.4). 
This  concept  of  nucleon  structure 
enables  the  difference  in  masses  of 
the  neutron  and  proton  to  be 
cleared  up,  as  well  as  the  difference 
of  the  magnetic  moment  of  the 
proton  and  neutron  from  p^.  These  differences  are  due  to  energy 
of  electromagnetic  interaction  between  the  “core”  of  the  nucleon 
and  the  pion  cloud. 

43.6.3  The  scattering  of  fast  electrons  having  energies  up  to 
550  MeV  by  protons  enabled  the  investigation  of  electric  charge 
density  distribution  in  the  proton  as  a  function  of  the  distance  r 
from  the  centre  of  the  “core”.  It  is  necessary  here  to  take  into 
account  the  fact  that  the  charge  of  the  proton  is  indivisible  and 
is  always  manifested  as  a  unified  whole.  Therefore,  the  distribu¬ 
tion  of  the  electric  charge  in  the  proton  does  not  imply  the  pos¬ 
sibility  of  experimentally  separating  out  a  definite  part  of  this 
charge.  Shown  in  Fig.  43.5a  is  the  dependence  of  the  radial  den¬ 
sity  of  charge  distribution  pg  —  dq/dr  on  r  (measured  in  fermi 
units),  where  dq  is  the  charge  contained  in  a  spherical  layer  with 
the  radii  r  and  r  -f  dr.  The  curve  in  the  figure  has  a  sharply 
defined  maximum  at  the  distance  Re  =  0.77  X  10~15  m,  which 
is  called  the  “ electric  radius ”  of  the  proton.  The  area  under  the 
curve  is  equal  to  the  charge  e  of  the  proton.  The  results  obtained 


Fig.  43.4 
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in  similar  experiments  in  scattering  fast  electrons  by  neutrons 
arc  given  in  Fig.  43.5 b. 

Scattering  proceeds  as  if  the  “electric  radius”  of  the  neutron 
was  equal  to  zero.  In  interaction  with  fast  electrons  the  neutron 


(a) 

Fig.  43.5 


behaves  as  if  its  pion  cloud  coincides  in  dimensions  with  its 
“core”.  The  electric  charge  is  negative  in  the  inner  and  outer  re¬ 
gions  of  the  neutron;  in  the  middle  region  it  is  positive.  The 
total  charge  of  the  neutron,  equal  to  the  area  under  the  curve, 
is  zero. 

The  structure  of  elementary  particles  is  being  intensively  in¬ 
vestigated  at  the  present  time. 
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PART  NINE  APPENDICES 


I.  SYSTEMS  OF  UNITS  OF  PHYSICAL  QUANTITIES 

1.1  A  unit  of  a  physical  quantity  is  an  arbitrarily  selected 
physical  quantity  having  the  same  physical  meaning  as  the  one 
being  considered.  A  system  of  units  is  the  whole  set  of  units, 
pertaining  to  a  certain  system  of  quantities  and  worked  out 
on  the  basis  of  accepted  rules.  The  base,  or  fundamental,  units  of 
a  given  system  are  the  units  of  several  dissimilar  physical  quan¬ 
tities,  arbitrarily  selected  in  establishing  the  system.  The  cor¬ 
responding  physical  quantities  are  called  base  quantities.  A  sy¬ 
stem  of  units  is  said  to  be  absolute  if  its  base  physical  quanti¬ 
ties  are  length,  mass  and  time. 

Derived  units  are  ones  established  from  other  units  of  the  given 
system  by  means  of  physical  laws  that  express  the  relations 
between  the  corresponding  physical  quantities. 

1.2  The  dimensional  formula  of  any  physical  quantity  is  an  expres¬ 
sion  characterizing  the  relations  of  the  units  of  this  physical 
quantity  with  the  base  units  of  the  given  system  of  units.  This 
expression  is  a  monomial  in  the  form  of  the  product  of  symbols 
representing  the  base  units  raised  to  the  corresponding  power 
(whole  or  fractional,  positive  or  negative).  A  physical  quantity 
is  said  to  be  dimensionless  if  in  its  dimensional  formula  all 
base  units  are  in  zero  power.  The  numerical  value  of  a  dimen¬ 
sionless  quantity  is  independent  of  the  choice  of  the  system  of 
units. 

1.3  Prefixes  and  their  symbols  for  decimal  multiples  and  sub¬ 
multiples  are  listed  in  Table  1.1. 

1.4  Preferred  in  most  countries  today  is  the  International  Sys¬ 
tem  (SI),  which  is  based  on  the  application  of  seven  base  units: 
metre,  kilogram,  second,  ampere,  kelvin,  mole  and  candela, 
as  well  as  two  supplementary  units,  the  radian  and  steradian. 
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The  definitions  and  notation  of  these  units  are  listed  in  Tables 
1. 2a  and  1. 2b. 

1.5  Derived  units  of  the  International  System  have  been  worked 
out  according  to  equations  relating  the  physical  quantities, 
corresponding  to  the  simplest  cases  of  phenomena  and  bodies. 
The  SI  derived  units  for  electric  and  magnetic  quantities  have 
been  established  in  accordance  with  the  rationalized  form  of 
electromagnetic  field  equations  (Sects.  14.2.7  and  23.2.2). 
The  most  important  derived  units  are  listed  in  Table  1.3. 

1.6  In  addition  to  SI  units,  it  is  permissible  in  certain  cases 
to  use  the  CGS  (for  example,  in  scientific  papers  in  physics). 
The  CGS  system  makes  use  (within  the  scope  of  mechanics)  of 
three  base  units:  the  centimetre  (cm)  for  length,  the  gram  (g) 
for  mass,  and  the  second  (s)  for  time.  There  are  also  two  supple¬ 
mentary  units:  the  radian  (rad)  for  a  plane  angle  and  the  ste- 
radian  (sr)  for  a  solid  angle.  All  of  these  units  either  coincide 
with  the  corresponding  SI  units  or  are  decimal  fractions  of  them: 
1  cm  =  10~2  m  and  1  g  =  10~3  kg. 

The  derived  CGS  units  for  mechanical  quantities  and  their  rela¬ 
tions  to  SI  units  are  listed  in  Table  1.4. 

1.7  Three  systems  of  units  for  electrical  and  magnetic  quanti¬ 
ties,  based  on  the  CGS  units  for  mechanical  quantities,  are  used 
in  physics.  They  are  the  absolute  electrostatic  (CGS-electrostatic) 
system,  the  absolute  electromagnetic  (CGS-electromagnetic)  sys¬ 
tem  and  the  absolute  Gaussian,  or  symmetrical,  system  of  units. 

In  the  CGS-electrostatic  system,  the  proportionality  factor  k 
(14.2.4)  in  Coulomb’s  law  is  considered  to  be  dimensionless  and 
equal  to  unity.  Accordingly,  Coulomb’s  law  and  all  other  rela¬ 
tions  of  electrostatics  and  electrodynamics  are  not  written  in 
the  rationalized  form. 

In  the  CGS-electromagnetic  system,  the  proportionality  factor  k 
in  the  Biot-Savart-Laplace  law  (23.2.1)  is  considered  to  be  dimen¬ 
sionless  and  equal  to  unity.  Hence,  this  law  and  all  other  rela¬ 
tions  of  electromagnetism  are  written  in  the  unrationalized 
form.  The  quantity  c,  indicating  the  number  of  units  of  electric 
charge  (or  electric  current)  in  the  CGS-electrostatic  system,  is 
equal  to  one  unit  of  electric  charge  (or  electric  current)  in  the 
CGS-electromagnetic  system  and  is  called  the  electrodynamic 
constant.  The  electrodynamic  constant  is  equal  to  the  velocity  of 
light  in  free  space:  c  —  2.90792458  X  1010  cm/s  »3x  1010 
cm/s. 
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TABLE  1.1 

Prefixes  Used  with  SI  Units 


Factor 

Prefix 

Factor 

Prefix 

Name 

Symbol 

Name 

Symbol 

1018 

exa 

E 

lO"1 

(deci) 

d 

1015 

peta 

P 

io~2 

(centi) 

c 

1012 

tera 

T 

lO-3 

milli 

m 

10" 

giga 

G 

10-6 

micro 

0 

106 

mega 

M 

10-9 

nano 

n 

103 

kilo 

k 

10-12 

pico 

p 

102 

(hecto) 

h 

10-15 

femto 

f 

101 

(deca) 

da 

IQ-18 

atto 

a 

Note.  Tlie  prefixes  enclosed  in  parentheses  may  be  used  only  in  the 
names  of  decimal  multiples  and  fractions  of  units  that  are  already 
widely  used  (for  example,  hectowatt,  decalitre,  decimetre,  and  centi¬ 
metre).  _ 


TABLE  1. 2a 


Base  Units  of  the  International  System  (SI) 


Quantity  ! 

SI  units 

0 

Name  of 
unit 

Unit 

symbol 

Definition 

Length 

1 

metre 

m 

One  metre  is  1  650  763.73 
wavelengths  in  vacuum  of 
the  radiation  correspond¬ 
ing  to  the  transition 
between  electronic  ener¬ 
gy  levels  2p10  and  5 d6 
of  the  krypton -86  atom 
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TABLE  1. 2a  (coni.) 


Quantity 

j  SI  units 

Name 

Dimen¬ 

sions 

Name  of 
unit 

Unit 

symbol 

Definition 

Mass 

M 

kilogram 

kg 

One  kilogram  is  the  mass 
of  the  platinum  standard 
kept  at  the  International 
Bureau  of  Weights  and 
Measures  at  Sevres, 

France 

Time 

T 

second 

s 

One  second  is  the  dura¬ 
tion  of  9  192  631  770  pe¬ 
riods  of  the  radiation 
corresponding  to  the 
transition  between  two 
hyperfine  levels  of  the 
ground  state  of  the  ce- 
sium-133  atom 

Electric  cur¬ 
rent 

I 

ampere 

A 

One  ampere  is  the  in¬ 
tensity  of  constant  cur¬ 
rent  which,  when  main¬ 
tained  in  two  parallel 
straight  conductors  of 
infinite  length  and  ne¬ 
gligible  cross  section 
placed  one  metre  apart 
in  a  vacuum,  would  pro¬ 
duce  between  them  a 
force  equal  to  2xl0-7  N 
per  metre  length 

Thermody¬ 
namic  tem¬ 
perature 

e 

Kelvin 

K 

One  kelvin  is  equal  to  the 
fraction  1/273.16  of  the 
thermodynamic  tempera¬ 
ture  of  the  triple  point 
of  water 
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TABLE  1. 2a  (cont.) 


Quantity 

SI  units 

Name 

Dimen¬ 

sions 

Name  of 
unit 

Unit 

symbol 

Definition 

Amount  of 
substance 

N 

mole 

mol 

One  mole  is  the  amount 
of  substance  of  a  system 
that  contains  as  many 
elementary  units  (atoms, 
molecules,  ions,  elec¬ 
trons,  other  particles, 
or  specified  groups  of  par¬ 
ticles)  as  there  are  atoms 
in  0.012  kilogram  of  the 
pure  nuclide  carbon-12 

Luminuous 

intensity 

J 

candela 

cd 

One  candela  is  the  lumi¬ 
nous  intensity,  in  the 
perpendicular  direction, 
of  a  surface  of  1/600  000 
square  metre  of  a  black- 
body  radiator  at  the  tem¬ 
perature  of  freezing  pla¬ 
tinum  under  a  pressure 
of  101  325  newtons  per 
square  metre,  i.  o. 

101  325  Pa 

Most  extensively  used  in  physics  is  the  CGS-Gaussian,  or  sym¬ 
metrical,  system  of  units,  in  which  the  units  for  measuring  all 
electrical  quantities  are  the  same  as  in  the  CGS-eleclroslatic 
system  and  those  for  measuring  magnetic  quantities  are  the 
same  as  in  the  CGS-electromagnetic  system.  In  the  Gaussian 
system,  as  in  the  CGS-eloctrostatic  system,  the  proportionality 
factor  k  (14.2.4)  in  Coulomb’s  law  is  dimensionless  and  equal  to 
unity,  whereas  the  proportionality  factor  k  in  the  Biot-Savart- 
Laplace  law  (23.2.1)  is  taken  equal  to  k  —  Me,  where  c  is  the 
electrodynamic  constant  (23.2.2). 
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1.8  Derived  CGS  units  (Gaussian  system)  for  electrical  quanti¬ 
ties  are  listed  in  Table  1.5,  and  for  magnetic  quantities,  in 
Table  1.6. 

1.9  Off-system,  or  arbitrary,  units,  permitted  for  use  in  physics 
and  astronomy,  are  listed  in  Table  1.7. 

1. 10  Oil-system  units  permitted  for  use  in  conjunction  with  SI 
units  are  listed  in  Table  1.8. 


TABLE  1. 2b 


Supplementary  Units  of  the  International  System  (SI) 


Quantity 

|  SI  units 

Name 

Unit 

symbol 

Definition 

Plane 

angle 

1 

radian 

rad 

One  radian  is  the  angle  sub¬ 
tended  at  the  centre  of  a 
circle  by  an  arc  the  length 
of  which  is  equal  to  the 
radius 

Solid 

angle 

1 

storadian 

sr 

One  steradian  is  the  solid 
angle  subtended  at  the  cen¬ 
tre  of  a  sphere  by  an  area 
of  surface  equal  to  the  square 
of  the  radius 
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Angular  momentum  L2MT-1  kilogram-metre  (kg-m2)/s 

squared  per  second 
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Moment  of  force  L2MT-2  newton-metre  N-m 

Impulse  LMT-1  newton-second  N-s 

Pressure,  mechanical  L-1MT-2  pascal  .  Pa  1  Pa=l  N/m2 

stress,  modulus  of 
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TABLE  1.4 

Derived  Units  of  the  CGS  System,  Mcehanical  Quantities 


Quantity 

j  Derived  CGS  unit 

Name 

Dimen¬ 

sions 

Name 

Symbol 

Expressed  in 
SI  units 

Density 

L-3M 

gram  per  cu¬ 
bic  centi¬ 
metre 

g/cm3 

103  kg/m3 

Moment  of 

L-M 

gram-ccnti- 

g-cm2 

10~7  kg-m2 

inertia 

metre 

squared 

Momentum 

LMT-1 

gram-centi- 

(g-em)/s 

10~?  (kg-m)/s 

metre  per 
second 

Angular  mo- 

L2MT-2 

gram-centi- 

(g-cm2)/s 

10“7(kg/m2l/s 

mentum 

metre 

squared  per 
second 

Force 

•  LMT-2 

dyne 

dyn 

10-5  n 

Moment  of 

L2MT“2 

dyno-centi- 

dyn-cm 

10-7  N-m 

force 

metre 

Impulse 

LMT-1 

dyne-second 

dyn-s 

IO-5  N-s 

Pressure,  me¬ 
chanical  stress, 
modulus  of 

L-iMT-2 

dyne  per 

square  cen¬ 
timetre 

dyn/cm2 

10-1  Pa 

elasticity 
Surface  tension 

MT"2 

dyne  per 

centimetre 

dyn/cm 

IO-3  N/m 

Work,  energy 

L2MT~2 

erg 

erg 

lO"7  .1 

Power 

L2MT-3 

erg  per  sec¬ 
ond 

erg/s 

£ 

r* 

1 

o 

tH 

Dynamic  vis- 

L_1MT_1 

poise 

P 

10“l  Pa-s 

cosity 

Kinematic 

L2T_1 

stokes 

St 

10~4  m2/s 

viscosity 

, 

688 


SYSTEMS  OF  UNITS 


TABLE  1.5 


Derived  Units  of  the  CGS  System,  Eleetrical  Quantities 


Quantity 

Derived 

CGS  units 

Name 

Dimensions 

Name 

Expressed  in 
SI  units 

Electric  current 

L3/2Ml/2T-2 

10/c  A 

Electric  current  density 

L-1/2M1/2T-2 

— 

105/c  A/m2 

Electric  charge 

L3/2M1/2T-1 

— 

10/c  C 

Electric  charge  density: 
(a)  linear 

L1/2M1/2T-1 

103/c  C/m 

(b)  surface 

L_1/2M1/2T_1 

— 

105/c  C/m2 

(c)  volume 

Polarization 

L~3/2M1/2T~1 

— 

107/c  C/m3 

Lr1/2M1/2T~1 

— 

105/c  C/m2 

Electric  dipole  moment 

Lr>/2M1/2T~1 

— 

l/10c  C-m 

Electric  displacement 
flux 

Electric  displacement 

L3/2Mi/2T-i 

— 

10/4jic  C 

L~1/2M1/2T~1 

_ 

105/4nc  C/m2 

Electric  potential,  volt- 

L1'2Ml/2T“1 

— 

10-8c  V 

age,  emf 

Electric  field  strength 

L_1/2M1/2T_1 

_ 

10“°c  V/m 

Capacitance 

L 

centi- 

108/c  F 

Electric  resistance 

L-1T 

metre 

10_9c2  Q 

Electric  resistivity 

T 

— 

10-uc2  Q-m 

Electric  conductance 

LT-1 

— 

109/c2  S 

TABLE  1.6 

Derived  Units  of  the  CGS  System,  Magnetic  Quantities 


Quantity  | 

Derived  CGS  units 

Name 

Dimensions 

Name 

Sym¬ 

bol 

Expressed  in 
SI  units 

Magnetic  flux 

L3/2M1/2T_1 

maxwell 

Mx 

10~8  Wh 

Magnetic  in¬ 
duction 

L_1/'2M'/2T-1 

gauss 

G 

10-*  T 
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TABLE  1.6  ( cont .) 


Quantity 

|  Derived  CGS 

units 

Name 

Dimensions' 

Name 

Sym¬ 

bol 

Expressed  in. 
SI  units 

Magnetomotive 

force 

L1/aM1/*T-1 

gilbert 

Gb 

10/4jx  A 

Magnetic  field 
strength 

L_1/2M1/2T_1 

oersted 

Oe 

103/4n  A/m 

Inductance, 
mutual  induc¬ 
tance 

L 

centimetre 

cm 

10-9  H 

Magnetic  mo¬ 
ment  of  elec¬ 
tric  current 

L6/2M1/2T~i 

■ 

10-s  A-m2 

Magnetization 

L-1/2M1/2T-1 

— 

103  A/m 

Magnetic  resis¬ 
tance  (reluc¬ 

tance) 

L"1 

■ 

109/4n  A/Wh 

TABLE  1.7 

Off-System  Units  Permitted  lor  Use  in  Physics  and 
Astronomy 


Off-system  units 

Quantity 

Name 

Symbol 

Expressed  in  SI  units 

Length 

astronomical  unit 

A.U. 

1. 49600X1011  m 

light-year 

lt-yr 

9.4605X1015  m 

parsec 

pc 

3.0857X1016  m 

X-unit 

X 

i. 00206 XlO-13  m 

Mass 

atomic  mass  unit 
(unified) 

U 

1. 66057  X10~27  kg 

Area 

barn 

b 

10*28  m2 

Energy 

electronvolt 

eV 

1. 60219 XlO"19  J 

i/2  44  —  0186 
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TABLE  1.8 

Off-System  Units  Permitted  for  Use  in  Conjunction  With 
SI  Units 


Quantity 

Off-system  units 

Name 

Name  of  unit 

Symbol 

Expressed  in 

SI  units 

Area  (plots  of 
ground) 

hectare 

ha 

10*  m2 

Volume 

litre 

1 

10~3  m3 

Plane  angle 

degree 

O 

1/180  rad 

minute 

/ 

1/10  800  rad 

second 

,  .  .  " 

1/648  000  rad 

Time 

minute 

min 

60  s 

hour 

h 

3600  s 

day 

d 

86  400  s 

week 

w 

— 

month 

mo 

— 

year 

yr 

— 

Mass 

tonne 

t 

103  kg 

Celsius  temper¬ 
ature,  temper¬ 
ature  differ¬ 

ence 

Celsius  degree 

°c 

Celsius  tempera¬ 
ture  t  =  T  —  273.15, 
where  T  is  the 
Kelvin  tempera¬ 
ture.  One  Celsius 
degree  is  equal  to 
one  kelvin 

II.  FUNDAMENTAL  PHYSICAL  CONSTANTS 

II.  1  Listed  in  Table  II.  1  are  the  fundamental  constants  com¬ 
piled  under  the  auspices  of  the  CODATA  Task  Group  on  Fun¬ 
damental  Constants.  They  have  been  officially  adopted  by 
CODATA  and  are  taken  originally  from  J.  Phys.  Chem.  Ref. 
Data,  Vol.  2,  No.  4,  p.  663  (1973)  and  CODATA  Bulletin  No.  11 
(December  1973). 
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FUNDAMENTAL  CONSTANTS- 


*  The  numbers  in  parentheses  are  the  one  standard-deviation  uncertainties  in  the  last  di¬ 
gits  of  the  quoted  values. 

**  The  uncertainty  (relative  error)  is  given  in  parts  per  million  (ppm). 


JNDEX 


Aberration,  geometric,  477 
Absolute  zero  temperature,  unat¬ 
tainability  of,  194 
Absorptance,  514 
Absorption  band,  484 
coefficient  for  elastic  waves, 
linear,  4i0 
factor,  483 
of  light,  482 
of  light,  resonance,  484 
negative,  573 
resonance,  646,  652 
spectrum,  continuous,  484 
wave,  410 
Absorptivity,  483 
Acceleration,  27 
absolute,  116 
angular,  32 
average,  27 
centripetal,  29 
Coriolis,  117 
due  to  gravity,  121 
of  free  fall,  121,  694 
instantaneous,  27 
linear,  33 
normal,  29 
of  pendulum,  392 
relative,  116 
rotary,  33 
tangential,  28  . 

'of  transportation,  117 
Accelerator,  linear  electrostatic, 
319 

linear  resonant,  319 
particle,  318 
resonant  cyclic,  319 
Acceptor  level,  619 
Action  radius,  631 
range,  631 

Active  medium,  574 
Activity,  637 
specific,  637 
Adiabat,  148 
Adiabatic  equation,  148 
Affinity,  electron,  565 
Ammeter  shunt,  277 
Ampere,  685,  688 
Ampere’s  force,  291 
Ampere’s  law,  291 
Amplitude,  beat,  377 
of  oscillation,  363 
wave,  403 


Analyser,  496 
Angle,  position,  30 
of  rotation,  30 

Angular  frequency  of  wave,  403 
Angular  momentum,  about  a 
fixed  axis,  72 
about  a  fixed  point,  72 
intrinsic,  336 
orbital,  335,  337 
variation  law  of,  77 
Anharmonicity,  coefficient  of,  581 
Anisotropy,  crystal  line,  580 
Annihilation,  positron-electron, 
672 

Anode,  278 
Antinode,  417 
standing  wave,  417 
Antiparticle,  671 
Antiproton,  671 
Anti-Stokes  line,  568 
Anti-Stokes  luminescent  radia¬ 
tion,  571 

Aperture,  numerical,  477 
Apertude  diaphragm,  476 
Arm,  of  couple,  73 
of  dipole,  227 

Aston  mass  spectrograph,  317 
Astronomical  unit,  689 
Atom,  554 
donor,  618 

Atomicity  of  electric  charges, 
280 

Atomic  mass  unit,  689,  691 
Atomic  nuclear  stability  crite¬ 
rion,  631 

Atomic  plane,  475 
Autophasing,  320 
Average  values,  128 
Avogadro  constant,  691 
Avogadro’s  law,  134 
Avogadro’s  number,  133,  691 
Axis,  free,  83 
of  rotation,  30 
instantaneous,  32 


Babinet’s  theorem,  473 
Balmer-Rydberg  formula,  554 
Balmer  series,  555 
Barn,  689 

Barometric  height  formula,  160 
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Barrier,  potential,  66 
Barrier  layer,  622 
effect,  622 
Baryon,  662,  669 
charge,  669 

Basis,  orthonormal,  21 
Beat,  377 
Beer’s  law,  483 
Billet  split  lens,  447 
Biot-Savart-Laplace  law,  292 
Biot-Savart  Law,  see  Biot-Sa- 
vart-Laplace  law 
Biot’s  law,  510 
Birefringence,  498 
Blaekbody,  514 

Blackness,  total  degree  of,  516 
Blocking  layer,  623 
Body,  homogeneous,  39 
perfectly  elastic,  20 
perfectly  inelastic,  20 
perfectly  rigid,  20 
of  reference,  20 
rigid,  20 

Bohr  frequency  condition,  557 
Bohr  magneton,  337,  692 
Bohr  radius,  559,  692 
Bohr’s  correspondence  principle, 
543 

Bohr’s  postulates,  557 
Boiling,  218 
Boiling  point,  219 
Boltzmann  constant,  134,  693 
Boltzmann  distribution  law,  159 
Boltzmann  relation,  188. 

Bond,  covalent,  565 
interatomic  chemical,  563 
ionic,  564 
length,  564 
metallic,  580 

Bose-Einstein  distribution  func¬ 
tion,  590 
Boson,  589 

Bouguer-Lambert-Fabrikant  law, 
574 

Bouguei'-Lambert  law,  482,  574 
Boundary,  continuous-spectrum, 
572 

Brackett  series,  555 
Bragg’s  law,  see  Wulff-Bragg 
condition 

Branch  point,  276,  307 
Brewster’s  angle,  438,  496 
Brewster’s  law,  496 
Brightness  temperature,  523 
Brownian  movement,  192 


Candela,  686,  689 
Capacitance,  244 
mutual,  245 

Capacitive  reactance,  397 
Capacitor  (s),  246 
cylindrical,  247 
parallel-connected,  248 
parallel-plate,  246 
plate,  246 

series-connected,  248 
spherical,  246 
Capillary  tube,  216 
Capture,  neutron,  652 
resonance,  652 
Carbon-nitrogen  cycle,  656 
Carnot  cycle,  178 
reverse,  180 
Carnot  theorem,  184 
Cathode,  278 
photoelectric,  525 
Cation,  278 
Causality,  88 
Celsius,  degree,  690 
scale,  130 

Centrigrade  scale,  international, 
130 

Centre,  impurity,  617 
of  vaporization,  218 
CGS  system  of  units,  677 
Change  in  momentum,  law  of, 
43 

Charge,  bound,  255 
conjugation  principle,  671 
density,  linear,  227 
surface,  227 
volume,  227 

distribution,  continuous,  226 
discrete,  226 
free,  255 
of  nucleus,  626 
Charge-to-mass  ratio,  317 
Cherenkov  radiation,  see  Vavi- 
lov-Cherenkov  radiation 
Circle,  osculating,  22 
Circuit,  active  portion  of,  273 
branched,  275 
magnetic,  306 
passive  portion  of,  273 
Circulation,  of  field  strength  vec¬ 
tor,  235 

of  magnetic  induction  vector, 
304 

Clapeyron-Clausius  equation,  see 
Clausius-Clapeyron  equation 
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Clapeyron  equation,  133 
Classical  mechanics,  19 
Clausius-Clapeyron  equation,  219, 
587 

Clock,  paradox,  94 
synchronism,  87 
Closed  mechanical  system,  38 
Coercive  force,  260,  348 
Coherence,  dimension,  spatial, 
452 

length,  446 
spatial,  452 

of  light,  degree  of,  450 
spatial,  452 
volume  of,  453 

Colliding  beam  technique,  323 
Collision,  66 
central,  66 
glancing,  66 
head-on,  66 
perfectly  elastic,  68 
inelastic,  67 

Collisions,  mean  number  of,  161 
Colour  temperature,  524 
Composition  of  velocities,  97 
Compound  pendulum,  equivalent 
length  of,  370 
Compression,  uniaxial,  584 
Compton,  effect,  530 
recoil  electron,  531 
wavelength,  530 
of  electron,  693 
of  neutron,  693 
of  proton,  693 

Concentration  of  solution,  equiv¬ 
alent,  283 

Conducting  direction,  623 
Conduction,  band,  614 
electron,  616 
extrinsic,  618 
heat,  138 
hole,  617 
impurity,  618 
intrinsic,  616 

in  semiconductors,  electrical, 
616 

n-type,  616 
p-type,  617 
unilateral,  622 
Conductivity,  electrical,  268 
Configuration,  of  system,  59 
zero,  60 

Conservation,  of  angular  momen¬ 
tum,  81 

of  baryon  charge,  670 
of  electric  charge,  220 


of  energy,  102,  140 
of  isotropic  spin,  669 
laws  of  elementary-particle 
physics,  668 
of  mass,  39 

of  mechanical  energy,  63 
of  momentum,  46 
of  relativistic  mass,  99 
of  relativistic  momentum,  99 
of  strangeness,  670 
Conservative  system,  55 
Constraint,  37 
Contact  layer,  620 
Continuum,  four-dimensional 
space-time,  111 
Contrast,  degree  of,  449 
Conversion,  internal,  643 
Cooler,  179 
Corkscrew  rule,  290 
Cotton-Mouton,  constant,  510 
effect,  509 
Coulomb,  684 
force,  222 
law,  222 
Couple,  73 
Critical  angle,  440 
Crystal,  biaxial,  498 
covalent,  580 
dextrorotary,  510 
ionic,  580 
lattice,  579 
levorotary,  510 
molecular,  580 

optically  positive,  uniaxial,  501 
uniaxial,  498 
valence,  580 
Crystallite,  579 
Curie,  law,  344 
point,  259,  347 
Current,  265 
carrier,  264 
conduction,  264 
convection,  264 
direct,  265 
electric,  264 
induced,  323 
molecular,  344 
orbital,  335 
photoelectric,  525 
quasi-stationary,  372 
saturation  photoelectric,  526 
self-induction,  329 
Curvature,  centre  of,  22 
Curve,  adiabatic,  148 
potential  energy,  66 
Cybotaxic  region,  208 


698 


INDEX 


Cycle,  177 
direct,  177 
reverse,  178 
Cyclotron,  319 
frequency-modulated,  321 


Damped  oscillations,  free,  385 
Damping,  logarithmic  decrement 
of,  386 

of  oscillations,  384 
Damping  factor,  384 
De  Broglie,  formula,  536 
wave,  536 

Debye  characteristic  temperature, 
609 

number,  287 

Debye-Hiickel  screening  radius, 
286 

Debye-Langevin  formula,  253 
shielding  length,  286 
law,  610 

Debye-Scherrer-Hull  powder  pho¬ 
tograph,  476 
Decay  constant  635 
Dee,  319 

De-excitation  time,  average,  432 
Deformation,  582 
plastic,  583 
relative,  583 

Degeneracy  parameter,  593 
Degenerate  system,  592 
Degree,  690 
of  freedom,  23 
Deionization,  280 
of  gas,  289 
Density,  39 
average,  39 

of  current  heating  capacity, 
volume,  269 

displacement  current,  353 
probability,  538 
radiant  flux,  522 
Depletion  range,  623 
Derived  units,  682 
Detailed  balance  principle,  515 
Diagram,  pT,  587 
Diamagnetic  material,  340 
Diatomic  gas  molecule,  163 
Dichroism,  504 
Dielectric,  249 
constant,  222 
absolute,  224 
complex,  490 
nonpolar,  249 


polar,  250 
polarized,  252 
Diffraction  angle,  467 
circular  aperture,  466,  469 
convergent  ray,  465 
grating,  constant,  470 
one-dimensional,  470 
space,  474 
light,  465 

minimum,  principal,  471 
narrow  long  slit,  467 
parallel  ray,  465 
small  disk,  467 
spectrum,  472 
Diffusion,  170 
coefficient,  171 
Dimension,  proper,  91 
formula,  676 

Dipole,  moment,  permanent,  250 
quasi-elastic  (induced),  250 
rigid,  250 

Discharge,  gas,  283 
Dispersion,  of  light,  487 
rotary,  510 
of  waves,  411 

Displacement,  electric,  231 
magnetic,  290 
static,  390 

Dissipative  system,  64 
Dissociation,  degree  of,  280 
Distinguishability  of  identical 
particles,  151 

Distribution,  law,  free  path,  160 
molecular  velocity,  154 
Disturbance,  mechanical,  399 
Domain,  258,  348 
Donor  level,  618 
Doppler  effect,  420,  440 
longitudinal,  442 
transverse,  443 

Doppler  spectral  line  broaden¬ 
ing,  443 

Dose  rate  of  radiation,  644 
Double-magic  number,  631 
Drift,  281 

Drude-Lorentz  electron  theory, 
267 

Dulong-Petit  law,  605 
Dynamic  viscosity  coefficient, 
172 

Dynamics,  19 
relativistic,  100 

of  rigid  body  rotating  about 
fixed  axis,  fundamental  law 
of,  79 
Dyne,  687 


INDJ3X 


699 


Effective  cross  section,  049 
Eigenfunction,  540 
Einstein’s,  general  theory  of  rel¬ 
ativity,  ill 

photoelectric  emission  equation, 
527 

relativity  principle,  35,  85,  441 
Elastic  body,  398 
Elasticity,  coefficient  of,  583 
degree  of,  583 
modulus  of,  584 
of  volume,  398 

Elastic  limit,  proportional,  583 
Elastic  wave,  longitudinal,  400 
sinusoidal,  402 
transverse,  400 

Electric  charge,  negative,  220 
point,  222 
positive,  220 
Electric  constant,  224 
Electric  dipole,  230 
moment,  230 
induced,  249 

Electric  displacement  flux,  231 
Electric  energy  source,  265 
Electric  fields,  superposition  of, 
227 

Electric  layer,  double,  020 
Electric  radius  of  proton,  674 
Electrochemical  equivalent,  278 
Electrode,  278 

Eiectrodynamic  constant,  077 
Electrodynamics,  264 
principal  problem  of,  350 
Electrolytic  dissociation,  278 
Electromagnetic  induction,  323 
Electromagnetic  system,  absolute, 
676 

Electromagnetic  wave,  422 
intensity  of,  429 
pressure  of,  430 
Electromotive  force,  272 
Electron,  220 
charge  to  mass  ratio,  692 
collective,  267 
conduction,  267,  616 
conversion,  643 
optical,  481 

magnetic  moment,  692 
radius,  classical,  692 
rest  energy,  694 
mass,  691 
shell,  562 
temperature,  289 


valence,  563 

Electron-hole  junction,  624 
Electronvolt,  689 
Electrostatics,  220 
Electrostatic  system,  absolute, 
677 

Elcctrostriction,  223 
Elementary  charge,  691 
Elementary  particles,  660 
Emf,  272 

Emissive  power,  514 
total,  513 

Emittance,  radiant,  513 
Energy,  51 
activation,  209,  616 
atomic  unit  of,  629 
band,  611 
binding,  555 
bound,  188 

in  central  force  field,  potential, 
60 

of  charged  capacitor,  261 
of  charged  conductor,  intrin¬ 
sic,  260 

of  currents,  mutual,  334 
density,  volume,  262,  333,  427 
kinetic,  407 
potential,  408 
dissipation  of,  64 
dissociation,  564 
efficiency,  photoluminescent, 
571 

eigenvalues  of,  540 
of  elastic  body,  potential,  62 
of  electrostatic  field,  261 
equipartition  of,  164 
flux,  409 
free,  187 
internal,  135 
intrinsic,  332 
kinetic,  56,  367 
level,  543 
impurity,  617 
quasi-continuous,  543 
width,  natural,  611 
magnetic  field,  332 
mutual  potential,  196 
nuclear  reaction,  649 
of  nucleon,  binding,  628 
potential,  59,  367 
quantum,  519 
rest,  102 

source,  electric,  271 
spectrum  of  particle,  540 
of  system,  binding,  104 
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Energy,  total,  134 
mechanical,  63 
transfer  velocity,  409 
of  translational  motion,  average 
kinetic,  154 

of  two  particles,  mutual  po¬ 
tential,  62,  108 
unavailable,  188 
in  uniform  field,  potential,  60 
zero-point,  545 
vibrational,  168 
Entropy,  186 

Equation,  in  dynamics  of  transla¬ 
tion,  basic,  44 

of  forced  electrical  oscillation, 
differential,  394 

of  forced  oscillation,  differen¬ 
tial,  388 

of  state,  calorific,  136 
of  state,  thermal,  131 
Equations  of  motion,  21 
Equilibrium,  mechanical,  64 
Equilibrium  state,  129 
Equipotential  surface,  242 
Equivalence  principle,  123 
local,  124 
Erg,  687 

Euler  formula  for  complex  num¬ 
bers,  365 

Exchange  interaction,  quantum- 
mechanical,  565 
Excited  state,  555 
Evaporation,  217 
Existance,  radiant,  514 
Expansion,  thermal,  580 
Expansion  coefficient,  580 
Exponent,  adiabatic,  148 
Exposure  dose  rate,  644 
Extension,  longitudinal,  584 
External  body,  38 
Extraordinary  ray,  499 


Fabry-Perot  etalon,  457 
Fajans-Soddy  law,  636 
Farad,  685 

Faraday,  constant,  692 
effect,  511 

law  of  electromagnetic  induc¬ 
tion,  323 

laws  of  electrolysis,  278,  279 
number,  692 
Fermi,  constant,  663 
energy,  595 
level,  596 


Fermi-Dirac  distribution  func¬ 
tion,  591 
Fermions,  590 
Ferroelectric  material,  257 
Ferromagnetic  material,  346 
Fick’s  law,  170 
Field,  36 

of  central  forces,  conservative, 
60 

conservative  nonstationary,  55 
stationary,  54 
electromagnetic,  225 
electrostatic,  224 
force,  224 
gravitational,  107 
induced  electric,  323 
of  interference,  447 
magnetic,  290 
physical,  36 
potential  steady,  54 
radiation,  430 
rotational,  305 
stationary,  36 
steady-state  electric,  224 
strength,  electric,  225 
magnetic,  294 
uniform  electric,  225 
Fine-structure  constant,  664,  691 
First  radiation  constant,  694 
Fission  parameter,  critical,  654 
Fluctuation,  189 
absolute,  190 
mean  square,  190 
relative,  190 
Fluidity,  208 
Fluorescence,  570 
Flux,  305 

density,  magnetic,  290 
linkage,  310 

of  magnetic  induction  vector, 
305 

Focusing,  axial,  353 
double,  318 
radial,  353 
strong,  322 

Focusing  condition,  weak,  322 
Forbidden  band,  612 
Force(s),  35 
active,  37 
arm  of,  70 
of  attraction,  195 
induction,  198 
orientational,  197 
central,  221 
centre  of,  60 
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Force(s),  centrifugal,  118 
centripetal,  42 
coercive,  260,  348 
conservative,  53 
Coriolis  inertial,  118 
Coulomb,  222 
couple,  73 
dispersion,  198 
dissipative,  55 
disturbing,  388 
driving,  388 
elastic,  583 
electromotive,  272 
external,  38 
extraneous,  269 
field,  conservative,  235 
function,  55 
gravitational,  104 
gravity,  120 
gyroscopic,  55,  119 
impact,  66 
Impulse  of,  40 
independence  of,  42 
Inertial,  118 
internal,  38 
Lorentz,  310 
magnetic,  311 
magnetizing,  306 
magnetomotive,  306 
motive,  52 
nonconservative,  63 
nuclear,  631 
potential-field,  53 
quasi-elastic,  366,  584 
reactive,  44 
of  repulsion,  195 
of  resistance,  52 
resultant,  36 
retarding,  52 
short-range,  195 
steady,  59 

of  transportation,  inertial,  118 
van  der  Waals,  197 
Fourier  heat  conduction  law,  173 
Frame  of  reference,  20 
chronometry  of,  85 
heliocentric,  35 
inertial,  35 
laboratory,  35 
noninertial,  115 
Fraunhofer  diffraction,  465 
Free  body,  37 
Freedom,  degree  of,  162 
Free  fall  of  a  body,  121 
Free  rotation,  83 
Freezing  point,  587 


Frenkel’s  hole  theory,  208 
Frequency,  angular,  363 
beat,  377 

basic  angular,  378 
of  oscillation,  363 
plasma,  371 
resonance,  391 
of  vibration,  natural,  419 
wave,  402 
Fresnel 
biprism,  447 
diffraction,  465 
formulas,  438 
mirror  principle,  446 
zone,  464 
Fringe  (s) 

equal-inclination,  455 
equal-thickness,  456 
interference,  448 

Fundamental  physical  constants, 
681 

Fusion,  585 


Galilean  transformation,  48 
Galileo’s  principle  of  relativity, 
50 

Gamma  radiation,  435 
Gamma  ray,  642 
Gas,  dilute,  175 
electric  breakdown  of,  285 
electron,  242 
ideal,  132 
rarified,  175 
real,  194 

van  der  Waals,  136,  200 
Gas  constant,  133 
specific,  133 
universal,  134 
Gas  discharge,  283 
self-maintained,  284 
semi-self-maintained,  283 
Gauss,  688 

theorem,  see  Ostrogradsky- 
Gauss  theorem 

Gaussian  system,  absolute,  677 
Generator,  coherent  light,  576 
Gilbert,  689 
Grain,  crystal,  579 
Gram  equivalent,  279 
Gravitational  constant,  105,  694 
Gravitational  red  shift,  648 
Gravitation,  relativistic  theory 
or,  ill 

Gravity,  zero,  123 
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Grazing  angle,  475 
Greybody,  515 
Ground  state,  555 
Group  of  bands,  566 
Gyromagnetic  ratio,  nuclear,  628 
of  orbital  magnetic  moments, 
336 

of  spin  moments,  336 
Gyroscope,  77 
-  balanced,  78 
regular  precession  of,  77 
symmetrical,  77 


Hall  effect,  314 

Harmonic  oscillator,  linear,  368 
Harmonics,  379 
quantity  of,  137 
reduced  quantity  of,  185 
of  vaporization,  219 
of  vaporization,  internal,  219 
of  vaporization,  specific  latent, 
219 

Heat 

capacity,  143 
molar,  143 

quantum  theory  of,  165 
specific,  143 
death,  189 
exchange,  138 
convective,  138 
radiative,  138,  512 
flow,  specific  rate  of,  173 
of  fusion,  586 
latent,  586 
Heat  sink,  179 
Heat  source,  179 
Heater,  179 
Hectare,  690 

Heisenberg  indeterminacy  rela¬ 
tions,  548 
Henry,  685 
Hertz,  682 

High-energy  physics,  662 
Hologram,  478 
Holography,  478 
Hooke’s  law,  583 
for  volume  deformation,  399 
Hopkinson  formula,  306 
Humphreys  series,  555 
Huygens-Fresnel  principle,  463 
Huygens  principle,  462 
Hybridization  of  energy  bands, 
615 

Hyperon,  662,  668 


Hysteresis,  dielectric,  259 
magnetic,  347 
Hysteresis  loop,  348 


Impact,  line  of,  66 
oblique,  66 
Impedance,  397 
of  oscillatory  circuit,  373 
Impulse  of  force,  40 
Incidence,  of  light,  inclined,  472 
plane  of,  436 

Indistinguishability  of  identical 
particles,  589 
Induction,  electric,  231 
magnetic,  290 
mutual,  330 

Inductive  reactance,  397 
Inertia,  centre  of,  39 
law  of,  34 
principal  axis  of,  76 
Infrared  radiation,  435 
Initial  instant,  23 
Instantaneous  acceleration,  27 
Instantaneous  power,  56.  398 
Instantaneous  velocity,  24 
Intensity,  electric,  226 
gravitational  field,  107 
of  ionization,  283 
of  light,  429 
magnetization,  340 
radiant,  522 
wave,  410 
Interaction 

coupling  constant,  663 
direct,  650 
electromagnetic,  664 
nuclear  strong,  628 
quantum-mechanical  exchange, 
565 

strong,  604 
time  constant  of,  663 
weak,  664 
Interference 
aperture  angle  of,  447 
wave,  414 
fringe,  448 

maximum,  order  of,  415 
minimum,  458 
Internal  friction,  171 
International  System  (SI),  678 
Tnterplanar  spacing,  475 
Interval,  spacelike,  95 
space-time,  9(i 
timelike,  95 
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Inverted  state,  575 
Ion 

drift,  281 
hydrogen-like,  554 
isoelectronic,  554 
Ionization,  of  atom,  282,  555 
collision,  283 
degree  of,  286 
intensity  of,  283 
surface,  285 
volume,  285 
Ionosphere,  286 
Irradiance.  522 
Isobar,  626 
Tsochromate,  508 
Isolated  mechanical  system,  38 
Tsopathic  fringes,  455 
Tsoprocess,  132 
Tsospin,  669 
Isotherm,  148,  202 
critical,  202 
Tsotope,  626 
Tsotope  effect,  601 


Joule,  683 

■Toule-Lenz  law,  269,  275 
Joule’s  law,  see  Joule-Lenz  law 
Junction,  276,  622 


Kaon,  668 
Kelvin,  130,  679 
Kepler’s  laws,  113 
Kerr 

constant,  509 
effect,  electrooptical,  509 
law,  509 
Kilogram,  679 

Kinematic  viscosity  coefficient, 
172 

Kinematics,  19 
relativistic,  97 

Kinetic  theory,  basic  equation 
of,  152 
of  gases,  151 
Kirchhoff’s  function,  515 
Kirchhoff’s  law  for  radiation,  515 
Kirchhoff’s  laws,  276 
for  branched  magnetic  circuits, 
307 

Ii  meson,  668 
Knudsen  effect,  176 
Koenig’s  theorem,  58 


Lambert  law,  482 
Langcvin-Debye  formula,  see  De- 
bye-Langevin  formula 
Langmuir  oscillations  of  plas¬ 
ma,  371 

Langmuir  plasma  frequency,  371 
Larmor  precession,  angular  ve¬ 
locity  of,  338 
Larmor’s  theorem,  338 
Laser,  576 

Lattice  constant,  579 
Lattice  plane,  475 
Laue  condition,  474 
Laue  diffraction  pattern,  476 
Launching  mass,  45 
Left-hand  rule,  291 
Lenz  law,  324 
Lepton,  662 
Lifetime,  average,  635 
mean  proper,  92 
Light,  433 
natural,  495 
optical  path  of,  453 
partially  polarized,  495 
pressure,  529 
reflection  law,  462 
refraction  law,  462 
visible,  435 
Light-year,  93,  689 
Line  of  action,  36 
Lines,  of  force,  226 
of  magnetic  force,  290 
Linewidth,  natural,  645 
Liquid,  207 
boiling,  202 
nonwetting,  213 
superheated,  205 
wetting,  213 

Liquid-drop  model  of  nucleus, 
630 

Lissajous  figures,  383 
Litre,  690 

Lorentz-Fitz  Gerald  contraction, 
90 

Lorentz 
force,  310 
generalized,  313 
formula,  313 
invariance,  90 
transformation,  89,  442 
Lumen,  686 

Luminance  temperature,  523 
Luminescence,  570 
Luminophor,  570 
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Lux,  686 

Lyman  series,  555 


Macrocurrent,  344 
Magic  number,  631 
Magnetic  constant,  293 
Magnetic  field,  290 
internal,  344 
uniform,  291 
Magnetic  flux,  305 
of  mutual  induction,  331 
quantum,  692 
of  self-induction,  327 
Magnetic  interaction,  force  of, 
311 

Magnetic  material,  soft,  348 
retentive,  348 

Magnetic  moment,  of  closed 
loop,  297 
nuclear,  627 
orbital,  335,  337 
spin,  336 

Magnetic  permeability,  relative, 
293 

Magnetic  resistance,  total,  307 
Magnetic  rotation,  511 
Magnetization,  remanent,  348 
saturation,  344,  348 
uniform  spontaneous,  349 
Magnetohydrodynamic  generator, 
287 

Malus  law,  496 
Maser,  576 
Mass 

centre  of,  39 

defect,  104,  629 

dependence  on  velocity,  99 

energy  relation,  102 

flow,  specific,  170 

initial,  45 

molar,  133 

number,  626 

of  a  particle,  38 

rest,  99 

spectrometry,  317 
transfer,  137 
Material  point,  20 
Maximum,  order  of,  459 
subordinate,  458 
Maxwell,  688 
rule,  290 

Maxwell-Boltzmann,  distribution, 

160 

law,  160 


Maxwellian  distribution,  154 
Maxwell’s  equal  area  rule,  205 
Maxwell’s  equations,  351 
complete  set  of,  358 
Maxwell’s  theory,  350 
Mayer  equation,  146 
Mean  free  path,  161 
Measurement,  550 
Mechanical  motion,  19 
Mechanical  system,  20 
Mechanics,  19 
quantum,  19,  535 
statistical,  128 
Medium,  continuous,  400 
dispersive,  411 
isotropic,  401 
linear,  401 
uniform,  401 
Melting,  585 
point,  585 

Mendeleev-Clapeyron  equation, 
134 

Mendeleev’s  periodic  system,  562 
Meniscus,  2i3 
Mesh  rule,  276 
Meshchersky  equation,  44 
Meson,  662 
Metal,  580 
Metre,  678 
Metric,  111 
MHD  generator,  287 
Microcurrent,  344 
Microfteld,  averaged,  351 
Mie  effect,  486 
Minimum,  additional,  471 
Mmf,  306 
Mobility,  ion,  281 
of  molecule,  211 
Modulation,  amplitude,  380 
frequency,  380 
phase,  380 

Modulus,  of  elasticity,  584 
of  elasticity,  bulk,  399 
shear,  585 

Molar  gas  constant,  693 
Molar  volume  of  ideal  gas,  C93 
Mole,  680 

Molecular  action,  range  of,  127, 
200 

sphere  of,  200 

Molecular-kinetic  theory,  126 
Molecule,  563 
atomic,  565 
diatomic  gas,  163 
effective  diameter  of,  132 
heteropolar,  584 
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Molecule 
homopolar,  565 
ionic,  564 
ionization  of,  282 
monatomic  gas,  162 
Moment  of  force,  about  fixed 
axis,  71 

about  fixed  point,  70 
Moment  of  inertia,  74 
principal,  77 
Moment,  resultant,  70 
Moments  of  inertia  of  regular 
bodies,  76 
Momentum,  40 
angular,  72 
moment  of,  72 
relativistic,  99 

Monatomic  gas  molecule,  162 
Monocrystal,  579 
Mosley’s  law.  573 
Mossbauer  effect,  647 
Motion 
absolute,  115 
accelerated.  25 
aperiodic,  388 
of  centre  of  mass,  43 
curvilinear,  22 
decelerated,  25 
differential  equation  of,  42 
equations  of,  21 

in  frame  of  reference  fixed  to 
earth,  relative,  119 
of  free  rigid  body,  80 
independent,  ’23 
mechanical,  19 
plane,  22,  34 
rectilinear,  22 
relative,  116 
rotary,  30 
thermal,  126 
translational,  29 
uniform,  25 

uniformly  accelerated,  28 
decelerated,  29 
variable,  28 
Mu  meson,  665 
Muon,  93,  665 
antineutrino,  665 
neutrino,  665 
Mutual  induction,  330 


Nernst  heat  theorem,  193 
Nernst  principle,  194 
Neutrino,  97 
Neutron,  626 


fission,  654 

multiplication  factor,  654 
prompt,  654 
rest  energy,  694 
rest  mass,  692 
slow,  652 
thermal,  652 
Newton,  684 
ring,  456 

Newtonian  mechanics,  19 
Newton’s  first  law  of  dynamics, 
34 

law  of  universal  gravitation, 
104 

second  law,  40 
third  law,  42 
viscosity  law,  172 
Node,  standing  wave,  417 
of  wave  function,  545 
Nonconservative  system,  55 
Nonwetting  liquid,  213 
Normal,  principal,  22 
Nozzle  velocity,  46 
N-type  conduction,  616 
Nuclear  fission,  spontaneous,  654 
Nuclear  forces,  exchange  inter¬ 
pretation  of,  633 
saturation  in,  632 
Nuclear,  magneton,  627,  693 
power  engineering,  654 
radius,  627 
reaction,  648 
reactor,  654 
recoil,  646 
time,  683 
Nucleon,  626,  662 
number,  628 
structure,  673 
Nucleus,  626 
compound,  650 
daughter,  635 
mirror,  631 
parent,  635 


Object  beam,  478 
Oersted,  689 
Ohm,  685 

Ohm’s  law,  for  closed  magnetic 
circuits,  306 

for  current  density,  268,  598 
of  current  density  in  electro¬ 
lytes,  281 

in  differential  form,  268 
generalized,  274 
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Optical  axis,  498 
Optical  homogeneity,  475 
Optically  active  substance,  510 
Optical-path  difference,  453 
Optical  pyrometer,  522 
Optical  radiation,  435 
Optics,  quantum,  524 
Orbit  quantization  rule,  557 
Order  in  liquids,  short-range,  208 
Order,  of  principal  maximum, 471 
Ordinary  ray,  499 
Oscillation(s),  3G2 
addition  of,  374 
antiphased,  370 
circularly  polarized,  382 
coherent,  370 
cophased,  370 
electromagnetic,  362,  374 
electromechanical,  362 
elliptically  polarized,  381 
forced,  302 
free,  362 

free  damped,  385 
harmonic,  363 
linearly  polarized,  382 
mechanical,  362 
quasi-periodic,  380 
Oscillator,  430 
linear  harmonic,  431,  543 
strength,  492 
Oscillatory  circuit,  372 
Ostrogradsky-Gauss  theorem,  233 
for  magnetic  ftelds,  306 


Packet,  wave,  411,  445 
Paramagnetic  material,  342 
Parsec,  689 
Particle(s),  20 
elementary,  660 
steady  state  of,  540 
Pascal,  683 
Paschen  series,  555 
Path,  22 
difference,  415 
length,  22 

Pendulum,  comp&und,  368 
simple,  370 
spring,  368 

Penetrability  factor,  551 
Perfectly,  elastic  body,  20 
inelastic  body,  20 
rigid  body,  20 
Period,  beat,  377 
of  oscillation,  363 


of  revolution,  32 
Permeability,  absolute  magnetic, 
293 

of  free  space,  293 
relative  magnetic,  293 
Permittivity,  absolute,  224 
of  free  space,  224 
relative,  222 

Perpetual  motion  machine,  of 
first  kind,  141 
of  second  kind,  183 
Pfund  series,  555 
Phase,  302 
of  oscillation,  363 
of  plane  wave,  403 
Phase,  stability  principle,  320 
transition,  first-order,  219 
second-order,  205 
Phase  velocity,  406,  423,  536 
Phasotron,  321 

Phenomenological  thermodyna¬ 
mics,  129 
Phonon,  603,  608 
Phosphorescence,  570 
Photoconduction,  525 
Photoelasticity,  508 
Photoelectric  effect,  barrier-layer, 
525 

in  gases,  525 
internal,  525 
multiple-photon,  528 
threshold,  526" 

Photoelectron,  525 
Photo-emf,  525 
Photoemission,  525 
Photoionization,  525 
Photoluminescence,  571 
quantum  yield  of,  571 
Photon,  97,  527 
mass  of,  528 
momentum  of,  529 
Physical  quantity,  quantized,  542 
Physics,  classical  statistical,  151 
high-energy,  662 
molecular,  126 
statistical,  128 
Pi  meson,  666 
Pion,  666 
triplet,  669 
Planck 

constant,  521,  691 
quantum  hypothesis,  519 
radiation  formula,  521 
Plane,  osculating,  22 
of  polarization,  426 


INDEX 


707 


Plane-parallel  motion,  34 
Plasma,  286 
fully  ionized,  286 
gas-discharge,  289 
gaseous,  287 
high-temperaturc,  286 
isothermal,  288 
weakly  ionized,  286 
Pleochroism,  604 
P-n  junction,  624 
Point,  lattice,  127 
Point  rule,  276 
Poise,  687 
Poisson 
equation,  148 
spot,  467 

Polarizability,  249 
Polarization,  252 
charge,  surface-bound,  254 
current  density,  355 
electronic,  252 
ionic,  252 
of  light,  495 
orientational,  252 
remanent,  259 
spontaneous,  258 
of  transverse  sinusoidal  waves, 
401 

Polarizer,  495 
Polarizing  prism,  503 
Pole,  70 
Polycrystal,  579 
Population  inversion,  575 
Positive  hole,  616 
Positron,  220 

Postulate  of  special  theory  of 
relativity,  84,  85 
Potential 
barrier,  551 
chemical,  590 
difference,  contact,  621 
internal  contact,  621 
electric  held,  234 
electrostatic  held,  238 
of  gravitational  field,  109 
ignition,  285 
ionization,  282,  555 
jump,  surface,  620 
retarding,  525 
well,  66,  541 
Power,  56 
active,  398 
average,  56 
factor,  398 
Poynting  vector,  428 


Precession,  regular,  77 
Prefixes  used  with  SI  units,  678 
Pressure,  129 
internal,  201,  212 
Probability  density,  538 
Process,  adiabatic,  132 
equilibrium  quasi-static,  131 
irreversible,  182 
isobaric,  132 
isochoric,  132 
isothermal,  132 
isovolumic,  132 
nonequilibrium,  132 
thermodynamic,  131 
Product  of  inertia,  75 
Proton,  220,  626 
magnetic  moment,  693 
proton  chain,  655 
proton  doublet,  669 
rest  energy,  694 
rest  mass,  691 
Pseudovector,  30 
Psi  function,  538 
P-type  conduction,  617 
P-wave,  437 
Pyrometry,  optical,  522 


Q  factor,  387  • 

Quadruple  moment  of  nucleus, 
628 

Quality  factor,  387  '  ■ 

Quantum,  of  circulation,  692 
of  energy,  520 
mechanics,  19,  535 
number,  543 
magnetic,  559 
orbital,  558 
principal,  555 
spin,  560 

spin  magnetic,  560 
optics,  524 

states,  number  of,  589 
statistics,  588 

theory  of  electrical  conduction 
in  metals,  598 
Quark,  663 

Quarter-wave  plate,  506 
Quasi-momentum,  608 
Quasi-particle,  608 


Radian,  681 
Radiance,  522 
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Radiant  flux,  522 
Radiant  flux  density,  522 
Radiating  system,  430 
Radiation,  anti-Stokes  lumines¬ 
cent,  571 
dose  rate  of,  644 
absorbed,  645 
equilibrium,  512 
isotropic,  513 
gamma,  435 
infrared,  435 
luminescent,  570 
optical,  435 
pyrometer,  522 
stimulated,  573 
temperature,  523 
thermal,  511 
ultraviolet,  435 
Vavilov-Cherenkov,  430,  492 
X,  435 

Radiator,  cosine,  523 
Radioactive  decay  law,  636 
Radioactive  family,  637 
Radioactive  half-life,  636 
Radioactivity,  633 
artificial,  635 
induced,  635 
natural,  635 

Radio-frequency  bands,  434 
Radius  of  curvature,  22 
Raman 
effect,  568 

scattering  of  light,  568 
spectrum,  569 
Rationalized  equations,  293 
Rationalized  law,  224 
Ray,  402 
incident,  436 
reflected,  436 
refracted,  436 
Rayleigh- Jeans  law,  518 
Rayleigh 
criterion,  477 
scattering,  485 
Reactance,  397 
Reaction,  chain  fusion,  655 
constraint,  37 
endothermic,  649 
exothermic,  649 
fission,  653 
nuclear,  649 
thermonuclear,  657 
Reaction,  channel,  649 
cross  section,  649 
Recombination,  282 
Rectifying  effect,  622 


Rectilinear  light  propagation  law, 
464 

Reference,  beam,  478 
frame,  21 

Reflection,  angle  of,  436 
coefficient,  439 
law  of,  437 
total  internal,  440 
Refraction,  double,  498 
law  of,  437 
Refractive,  angle,  436 
index,  absolute,  435 
complex,  483 
relative,  436 

Relationship,  cause-and-effect,  88 
Relativistic,  dynamics,  basic  law 
of,  100 
effect,  84 
invariance,  90 
mechanics,  19 
theory,  84 

Relativity,  mechanical  principle 
of,  50 

principle  of,  85 
special  theory  of,  84 
theory,  restricted,  84 
Relaxation  time,  209 
Releasability,  principle  of,  37 
Reluctance,  307 
Resistance,  273,  397 
additional,  277 
coefficient,  385 
total  magnetic,  307 
Resistivity,  electric,  268 
residual,  601 
Resolution,  477 
limit,  angular,  477 
Resolving  power,  477 
Resonance 
curve,  391 

of  oscillatory  circuit,  399 
mechanical,  391 
particle,  663 
Resonon,  663 

Right-hand  screw  rule,  290 
Rochelle  salt,  259 
Roentgen  equivalent  man  (rem), 
645 

Roentgen  rays,  435 
Rotation,  30 
angle  of,  30 
about  fixed  axis,  32 
free,  83 

specific,  510,  511 
uniform,  31 
Rotational  speed,  32 
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Rotatory  power,  510,  511 
Rule,  right-hand  screw,  31 
Rydberg  constant,  554,  692 


Satellite  line,  569 
Scattering,  combination,  568 
of  light,  485 
molecular,  485 

Schrodinger,  steady-state  equa¬ 
tion,  557 

time-dependent  wave  equation, 
539 

wave  function,  538 
Screens,  complementary,  473 
Seat  of  emf,  394 
Second,  683,  687 
Second  radiation  constant,  694 
Seignette’s  salt,  259 
Self-inductance,  328 
Self-induction,  327 
Semiconductor(s),  6i5 
diode,  625 
Series  limit,  555 
Shear,  585 
modulus,  585 
plane,  585 
strain,  unit,  585 
Siemens,  685 
Sign  rule  for  emf,  274 
Sites,  lattice,  127 
Snell’s  law,  437 
Soddy’s  displacement  law,  636 
Solenoid,  300 
Solid,  579 
amorphous,  579 
solution,  588 

Sonic  waves,  phase  velocity  of, 
406 

Sound,  audible,  400 
Sources,  coherent,  414 
Space 

charge  layer,  623 
homogeneity  of,  46 
isotropy  of,  82 
lattice,  579 
quantization,  559 
Spectroscopy,  nuclear,  647 
Spectrum,  band,  566 
continuous  absorption,  484 
diffraction,  473 
dispersion,  487 

of  electrons,  band  energy,  611 
electron-vibrational,  568 
frequency,  379 


light  absorption,  483 
line  X-ray,  572 
mass,  317 
molecular,  566 
oscillation,  379 
rotation,  567 
rotational,  32 
vibrational,  567 
Spin,  electron,  336 
isotopic,  669 
nuclear,  627 

quantum  number,  inner,  627 
State 

function,  132 
inverted,  575 
stable,  65 

of  thermodynamic  equilibrium, 
129 

unstable,  65 
Statics,  19 
Statistical  laws,  128 
Steady,  field,  36 
state,  129 
postulate,  557 

Stefan-Boltzmann,  constant,  516, 
693 

law,  516 

Steiner’s  theorem  of  parallel 
axes,  75 
Steradian,  681 
Stokes,  687 
line,  568 
theorem,  356 
Stoletov  law,  526 
Storage  factor,  387 
Strain,  unit,  583 
Strangeness,  670 
Strength,  magnetic  field,  345 
ultimate,  584 
Stress,  583 

Stress-strain  diagram,  tensile, 
584 

Sublimation,  585 
Subshell,  562 
Superconductivity,  601 
Superfluidity,  206 
Superposition  of  fields,  prin¬ 
ciple  of,  108 
of  forces,  42 

Surface,  equipotential,  242 
tension,  212 
wave,  402 
of  wave,  ray,  500 
Surface-active  substance,  213 
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Susceptibility,  complex  dielectric, 
490 

dielectric,  253 
electric,  253 
magnetic,  342 
Suspension,  centre  of,  78 
point  of,  369 
S-wave,  437 
Synchrocyclotron,  321 
Synchrotron,  321 
System,  linear,  384 
of  particles,  20 


Target  nucleus,  649 
Tautochronous  paths,  453 
Temperature,  129 
absolute,  130 
absolute  zero,  130 
scale,  absolute,  130 
empirical,  130 
thermodynamic,  130,  184 
characteristic,  607 
colour,  522 
critical,  202 

Debye  characteristic,  609 
degeneracy,  593 
-  electron,  289 
luminance,  523 
of  solidification,  587 
Temperature-indicating  body,  130 
Tensile  strength,  584 
Term,  555 
Tesla,  685 

Test  charge,  electric,  225 
Thermal  conductiivty,  173 
Thermal  efficiency,  179 
Thermodynamic  method,  128 
Thermodynamic,  probability  of  a 
state,  188 
processes,  150 
reversible,  181 
system,  129 

adiabatically  isolated,  140 
closed,  139 

thermally  isolated,  140 
Thermodynamics,  129 
first  law  of,  140 
phenomenological,  129 
second  law  of,  183 
third  law  of,  193 
Thermonuclear  chain,  655 
Thomson  formula,  373 
Three-level  procedure,  576 


Time 

coherence,  of  nonmonochroma- 
tic  waves,  446 
vibrations,  449 
dilation,  92 
proper,  92 
reference,  zero,  63 
relaxation,  209 
slowing  of,  92 
uniformity  of,  63 
Tonne,  690 
Top,  78 
Toroid,  299 

Total  current  law,  for  a  mag¬ 
netic  field,  344 
in  vacuum,  304 
Transformation,  Galilean,  48 
Lorentz,  89,  442 
Transistor,  625 
Transition,  elements,  563 
temperature  to  superconducting 
state,  601 
Translation,  29 
Transmission  factor,  440 
Transmittance,  440 
Transport,  equations,  174 
phenomena,  170 

Transuranio  chemical  elements, 
652 

Trapping  level,  619 
Travelling  speed,  25 
Travelling  wave  equation,  401 
Triode,  crystal,  625 
Tsiolkovsky’s  formula,  45 
Tunnel  effect,  551 
Twin  paradox,  94 
Two-fluid  theory,  206 
Tvndal  effect,  485 


Ultimate  strength,  584 
Ultraviolet,  catastrophe,  518 
radiation,  435 

Umov-Poynting  vector,  428 
Umov  vector,  416 
Unit,  base,  678 
derived,  682 
fundamental,  676 
off-system,  689 
of  physical  quantity,  676 
Units,  system  of,  676 
Universal  gravitation,  constant 
of,  105 


INDEX 


in 


Vacuum,  175 
degrees  of,  175 
high,  175 

Van  de  Graaff  electrostatic  gene¬ 
rator,  319 

Van  der  Waais  coefficient,  136 
Vaporization,  217 
Vapour,  194 
dry  saturated,  202 
supersaturated,  205 
wet,  202 

Variable,  external  state,  130 
internal,  131 
state,  129 

thermodynamic,  129 
Variation,  harmonic,  365 
Vavilov-Cherenkov  radiation,  430, 
492 

Vavilov’s  law,  571 
Vector,  axial,  30 
current  density,  266 
displacement,  23 
of  external  forces,  principal,  43 
flux  of  magnetic  induction,  305 
magnetic  induction,  290 
magnetization,  340 
nonlocalized,  37 
polar,  30 
polarization,  252 
potential  gradient,  241 
Poynting,  428 
radius,  21 
Umov,  410 
Umov-Poynting,  £28 
Vector  diagram  method,  365 
Velocity,  24 
absolute,  116 
angular,  30 
areal,  26,  113 
arithmetic  mean,  157 
average  relative,  159 
bulk,  116 

circular  orbital,  114 
drift,  116 

energy  transfer,  409 
escape,  114 
fexhaust  gas,  45 
group,  413 
instantaneous,  24 
of  light,  invariance  of,  85 
linear,  31 
most  probable,  156 
of  multistage  (combination) 
rocket,  characteristic,  45 
orbital,  114 
parabolic,  114 


of  pendulum,  391 
phase,  406,  423,  536 
radial,  26 
relative,  116 

of  rocket,  characteristic,  45 
solar  escape,  115 
of  sound,  406 
of  source,  ray,  422 
space,  112 
spin,  78 

of  transportation,  116 
transverse,  26 
wave,  404 
of  wave,  ray,  500 
Verdet  constant,  511 
Vibration,  362 
isochronous,  370 
zero,  546 

Vibrator,  elementary,  430 
Viscosity,  171,  172 
Visibility  of  fringes,  449 
Visible  radiation,  435 
Volt,  684 
Voltage,  272 
breakdown,  248,  285 
drop,  272 
retarding,  525 
source,  394 
Voltmeter,  277 
Volume,  molar,  133 
specific,  129 


Watt,  683 

Wave(s),  acoustic,  399 
centre,  404 
coherent,  414 
partly,  449 

circularly  polarized,  420 
clastic,  399 
electromagnetic,  422 
elliptically  polarized,  426 
equation,  405 
extraordinary,  499 
incindent  plane,  436 
length  of  standing,  417 
linearly  polarized,  426 
matter,  536 
mechanics,  535 
monochromatic,  425 
ordinary,  499 
plane,  403 

plane  monochromatic,  425 
plane  quasi-sine,  412 
radio,  433 
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Wave(s) 
reference,  478 
reflected  plane,  436 
refracted  plane,  436 
signal,  478 
sine,  406 
sonic,  399 
source,  399 
spherical,  404 
standing,  416 
superposition  of,  411 
surface,  400 
train,  411,  445 
length,  harmonic,  446 
travelling,  401 
vector,  404 
zone,  435 
Wavefront,  402 
Wavenumber,  404 
Weak  interaction  constant,  664 
Weber,  685 

Weight  of  a  body,  122 
Weightlessness,  123 
Wetted  perimeter,  213 
Wetting,  angle,  213 
liquid,  213 

Wiedmann- Franz  law,  269 
Wien,  constant,  517 


displacement  law,  517 
formula,  518 
Work,  137 
expansion,  137 
of  force,  element  of,  52 
function,  620 
of  ionization,  282 
Working  medium,  177 
Wulff-Bragg  condition,  476 


X  radiation,  435 
characteristic,  572 
X  ray(s),  572 
braked,  572 

powder  photograph,  476 
structure  analysis,  476 
X-unit,  689 


Yield  point,  584 
Young’s  modulus,  584 
Young’s  two-slit  interference 
method,  451 
one  plate,  465 
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